This is the code repo for paper CREAM: Consistency Regularized Self-Rewarding Language Models accepted to ICLR 2025. CREAM extends the Self-Rewarding Language Model (SRLM) to small models (e.g., ...
Fig. 1. Overall Architecture. Our framework extracts spatial and dynamic features in parallel using the Slow and Fast pathways. Bi-directional Feature Fusion (BFF) facilitates the exchange of rich ...
Abstract: Federated Learning is a machine learning methodology that emphasizes data privacy, involving minimal interaction with each other’s systems, primarily exchanging model parameters. However, ...