diff --git a/current/2024-12-23 B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners.yaml b/archive/238/2024-12-23+B-STaR%3A+Monitoring+and+Balancing+Exploration+and+Exploitation+in+Self-Taught+Reasoners.yaml similarity index 100% rename from current/2024-12-23 B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners.yaml rename to archive/238/2024-12-23+B-STaR%3A+Monitoring+and+Balancing+Exploration+and+Exploitation+in+Self-Taught+Reasoners.yaml diff --git a/current/2024-12-23 Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching.yaml b/archive/238/2024-12-23+Distilled+Decoding+1%3A+One-step+Sampling+of+Image+Auto-regressive+Models+with+Flow+Matching.yaml similarity index 100% rename from current/2024-12-23 Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching.yaml rename to archive/238/2024-12-23+Distilled+Decoding+1%3A+One-step+Sampling+of+Image+Auto-regressive+Models+with+Flow+Matching.yaml diff --git a/current/2024-12-23 Diving into Self-Evolving Training for Multimodal Reasoning.yaml b/archive/238/2024-12-23+Diving+into+Self-Evolving+Training+for+Multimodal+Reasoning.yaml similarity index 100% rename from current/2024-12-23 Diving into Self-Evolving Training for Multimodal Reasoning.yaml rename to archive/238/2024-12-23+Diving+into+Self-Evolving+Training+for+Multimodal+Reasoning.yaml diff --git a/current/2024-12-23 NILE: Internal Consistency Alignment in Large Language Models.yaml b/archive/238/2024-12-23+NILE%3A+Internal+Consistency+Alignment+in+Large+Language+Models.yaml similarity index 100% rename from current/2024-12-23 NILE: Internal Consistency Alignment in Large Language Models.yaml rename to archive/238/2024-12-23+NILE%3A+Internal+Consistency+Alignment+in+Large+Language+Models.yaml diff --git a/current/2024-12-23 Revisiting In-Context Learning with Long Context Language Models.yaml b/archive/238/2024-12-23+Revisiting+In-Context+Learning+with+Long+Context+Language+Models.yaml similarity index 100% rename from current/2024-12-23 Revisiting In-Context Learning with Long Context Language Models.yaml rename to archive/238/2024-12-23+Revisiting+In-Context+Learning+with+Long+Context+Language+Models.yaml diff --git a/current/2024-12-23 RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response.yaml b/archive/238/2024-12-23+RobustFT%3A+Robust+Supervised+Fine-tuning+for+Large+Language+Models+under+Noisy+Response.yaml similarity index 100% rename from current/2024-12-23 RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response.yaml rename to archive/238/2024-12-23+RobustFT%3A+Robust+Supervised+Fine-tuning+for+Large+Language+Models+under+Noisy+Response.yaml diff --git a/current/2024-12-24 Agent-SafetyBench: Evaluating the Safety of LLM Agents.yaml b/archive/238/2024-12-24+Agent-SafetyBench%3A+Evaluating+the+Safety+of+LLM+Agents.yaml similarity index 100% rename from current/2024-12-24 Agent-SafetyBench: Evaluating the Safety of LLM Agents.yaml rename to archive/238/2024-12-24+Agent-SafetyBench%3A+Evaluating+the+Safety+of+LLM+Agents.yaml diff --git a/current/2024-12-24 DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought.yaml b/archive/238/2024-12-24+DRT-o1%3A+Optimized+Deep+Reasoning+Translation+via+Long+Chain-of-Thought.yaml similarity index 100% rename from current/2024-12-24 DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought.yaml rename to archive/238/2024-12-24+DRT-o1%3A+Optimized+Deep+Reasoning+Translation+via+Long+Chain-of-Thought.yaml diff --git a/current/2024-12-24 Deliberation in Latent Space via Differentiable Cache Augmentation.yaml b/archive/238/2024-12-24+Deliberation+in+Latent+Space+via+Differentiable+Cache+Augmentation.yaml similarity index 100% rename from current/2024-12-24 Deliberation in Latent Space via Differentiable Cache Augmentation.yaml rename to archive/238/2024-12-24+Deliberation+in+Latent+Space+via+Differentiable+Cache+Augmentation.yaml diff --git a/current/2024-12-24 Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding.yaml b/archive/238/2024-12-24+Friends-MMC%3A+A+Dataset+for+Multi-modal+Multi-party+Conversation+Understanding.yaml similarity index 100% rename from current/2024-12-24 Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding.yaml rename to archive/238/2024-12-24+Friends-MMC%3A+A+Dataset+for+Multi-modal+Multi-party+Conversation+Understanding.yaml diff --git a/current/2024-12-24 Large Motion Video Autoencoding with Cross-modal Video VAE.yaml b/archive/238/2024-12-24+Large+Motion+Video+Autoencoding+with+Cross-modal+Video+VAE.yaml similarity index 100% rename from current/2024-12-24 Large Motion Video Autoencoding with Cross-modal Video VAE.yaml rename to archive/238/2024-12-24+Large+Motion+Video+Autoencoding+with+Cross-modal+Video+VAE.yaml diff --git a/current/2024-12-24 LearnLM: Improving Gemini for Learning.yaml b/archive/238/2024-12-24+LearnLM%3A+Improving+Gemini+for+Learning.yaml similarity index 100% rename from current/2024-12-24 LearnLM: Improving Gemini for Learning.yaml rename to archive/238/2024-12-24+LearnLM%3A+Improving+Gemini+for+Learning.yaml diff --git a/current/2024-12-24 OpenAI o1 System Card.yaml b/archive/238/2024-12-24+OpenAI+o1+System+Card.yaml similarity index 100% rename from current/2024-12-24 OpenAI o1 System Card.yaml rename to archive/238/2024-12-24+OpenAI+o1+System+Card.yaml diff --git a/current/2024-12-24 OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning.yaml b/archive/238/2024-12-24+OpenRFT%3A+Adapting+Reasoning+Foundation+Model+for+Domain-specific+Tasks+with+Reinforcement+Fine-Tuning.yaml similarity index 100% rename from current/2024-12-24 OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning.yaml rename to archive/238/2024-12-24+OpenRFT%3A+Adapting+Reasoning+Foundation+Model+for+Domain-specific+Tasks+with+Reinforcement+Fine-Tuning.yaml diff --git a/current/2024-12-24 Outcome-Refining Process Supervision for Code Generation.yaml b/archive/238/2024-12-24+Outcome-Refining+Process+Supervision+for+Code+Generation.yaml similarity index 100% rename from current/2024-12-24 Outcome-Refining Process Supervision for Code Generation.yaml rename to archive/238/2024-12-24+Outcome-Refining+Process+Supervision+for+Code+Generation.yaml diff --git a/current/2024-12-24 PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital World.yaml b/archive/238/2024-12-24+PC+Agent%3A+While+You+Sleep%2C+AI+Works+--+A+Cognitive+Journey+into+Digital+World.yaml similarity index 100% rename from current/2024-12-24 PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital World.yaml rename to archive/238/2024-12-24+PC+Agent%3A+While+You+Sleep%2C+AI+Works+--+A+Cognitive+Journey+into+Digital+World.yaml diff --git a/current/2024-12-24 ResearchTown: Simulator of Human Research Community.yaml b/archive/238/2024-12-24+ResearchTown%3A+Simulator+of+Human+Research+Community.yaml similarity index 100% rename from current/2024-12-24 ResearchTown: Simulator of Human Research Community.yaml rename to archive/238/2024-12-24+ResearchTown%3A+Simulator+of+Human+Research+Community.yaml diff --git a/tags/ML.md b/tags/ML.md index e2ac2af7..5769ce58 100644 --- a/tags/ML.md +++ b/tags/ML.md @@ -2101,3 +2101,20 @@ - [Offline Reinforcement Learning for LLM Multi-Step Reasoning](https://github.com/deep-diver/hf-daily-paper-newsletter/blob/main/archive/237/2024-12-23+Offline+Reinforcement+Learning+for+LLM+Multi-Step+Reasoning.yaml) / 2024-12-23 - [Sequence Matters: Harnessing Video Models in 3D Super-Resolution](https://github.com/deep-diver/hf-daily-paper-newsletter/blob/main/archive/237/2024-12-23+Sequence+Matters%3A+Harnessing+Video+Models+in+3D+Super-Resolution.yaml) / 2024-12-23 - [TRecViT: A Recurrent Video Transformer](https://github.com/deep-diver/hf-daily-paper-newsletter/blob/main/archive/237/2024-12-23+TRecViT%3A+A+Recurrent+Video+Transformer.yaml) / 2024-12-23 +- [B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners](https://github.com/deep-diver/hf-daily-paper-newsletter/blob/main/archive/238/2024-12-23+B-STaR%3A+Monitoring+and+Balancing+Exploration+and+Exploitation+in+Self-Taught+Reasoners.yaml) / 2024-12-23 +- [Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching](https://github.com/deep-diver/hf-daily-paper-newsletter/blob/main/archive/238/2024-12-23+Distilled+Decoding+1%3A+One-step+Sampling+of+Image+Auto-regressive+Models+with+Flow+Matching.yaml) / 2024-12-23 +- [Diving into Self-Evolving Training for Multimodal Reasoning](https://github.com/deep-diver/hf-daily-paper-newsletter/blob/main/archive/238/2024-12-23+Diving+into+Self-Evolving+Training+for+Multimodal+Reasoning.yaml) / 2024-12-23 +- [NILE: Internal Consistency Alignment in Large Language Models](https://github.com/deep-diver/hf-daily-paper-newsletter/blob/main/archive/238/2024-12-23+NILE%3A+Internal+Consistency+Alignment+in+Large+Language+Models.yaml) / 2024-12-23 +- [Revisiting In-Context Learning with Long Context Language Models](https://github.com/deep-diver/hf-daily-paper-newsletter/blob/main/archive/238/2024-12-23+Revisiting+In-Context+Learning+with+Long+Context+Language+Models.yaml) / 2024-12-23 +- [RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response](https://github.com/deep-diver/hf-daily-paper-newsletter/blob/main/archive/238/2024-12-23+RobustFT%3A+Robust+Supervised+Fine-tuning+for+Large+Language+Models+under+Noisy+Response.yaml) / 2024-12-23 +- [Agent-SafetyBench: Evaluating the Safety of LLM Agents](https://github.com/deep-diver/hf-daily-paper-newsletter/blob/main/archive/238/2024-12-24+Agent-SafetyBench%3A+Evaluating+the+Safety+of+LLM+Agents.yaml) / 2024-12-24 +- [DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought](https://github.com/deep-diver/hf-daily-paper-newsletter/blob/main/archive/238/2024-12-24+DRT-o1%3A+Optimized+Deep+Reasoning+Translation+via+Long+Chain-of-Thought.yaml) / 2024-12-24 +- [Deliberation in Latent Space via Differentiable Cache Augmentation](https://github.com/deep-diver/hf-daily-paper-newsletter/blob/main/archive/238/2024-12-24+Deliberation+in+Latent+Space+via+Differentiable+Cache+Augmentation.yaml) / 2024-12-24 +- [Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding](https://github.com/deep-diver/hf-daily-paper-newsletter/blob/main/archive/238/2024-12-24+Friends-MMC%3A+A+Dataset+for+Multi-modal+Multi-party+Conversation+Understanding.yaml) / 2024-12-24 +- [Large Motion Video Autoencoding with Cross-modal Video VAE](https://github.com/deep-diver/hf-daily-paper-newsletter/blob/main/archive/238/2024-12-24+Large+Motion+Video+Autoencoding+with+Cross-modal+Video+VAE.yaml) / 2024-12-24 +- [LearnLM: Improving Gemini for Learning](https://github.com/deep-diver/hf-daily-paper-newsletter/blob/main/archive/238/2024-12-24+LearnLM%3A+Improving+Gemini+for+Learning.yaml) / 2024-12-24 +- [OpenAI o1 System Card](https://github.com/deep-diver/hf-daily-paper-newsletter/blob/main/archive/238/2024-12-24+OpenAI+o1+System+Card.yaml) / 2024-12-24 +- [OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning](https://github.com/deep-diver/hf-daily-paper-newsletter/blob/main/archive/238/2024-12-24+OpenRFT%3A+Adapting+Reasoning+Foundation+Model+for+Domain-specific+Tasks+with+Reinforcement+Fine-Tuning.yaml) / 2024-12-24 +- [Outcome-Refining Process Supervision for Code Generation](https://github.com/deep-diver/hf-daily-paper-newsletter/blob/main/archive/238/2024-12-24+Outcome-Refining+Process+Supervision+for+Code+Generation.yaml) / 2024-12-24 +- [PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital World](https://github.com/deep-diver/hf-daily-paper-newsletter/blob/main/archive/238/2024-12-24+PC+Agent%3A+While+You+Sleep%2C+AI+Works+--+A+Cognitive+Journey+into+Digital+World.yaml) / 2024-12-24 +- [ResearchTown: Simulator of Human Research Community](https://github.com/deep-diver/hf-daily-paper-newsletter/blob/main/archive/238/2024-12-24+ResearchTown%3A+Simulator+of+Human+Research+Community.yaml) / 2024-12-24