Arxiv Papers

This paper argues that compensating human labor for training data is the largest cost in developing Large Language Models, significantly exceeding model training expenses, and suggests fairer practices for the future. https://arxiv.org/abs//2504.12427 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

Duration:00:07:16

Position: The Most Expensive Part of an LLM should be its Training Data

Duration:00:20:05

[QA] Activated LoRA: Fine-tuned LLMs for Intrinsics

Activated LoRA (aLoRA) enhances LoRA by adapting weights only for relevant tokens, allowing instant activation without recomputing the KV cache, improving efficiency in multiturn settings. https://arxiv.org/abs//2504.12397 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

Duration:00:08:16

Activated LoRA: Fine-tuned LLMs for Intrinsics

Duration:00:18:55

[QA] COLORBENCH: Can VLMs See and Understand the Colorful World?

The paper presents COLORBENCH, a benchmark to evaluate vision-language models' color understanding, revealing limitations and emphasizing the need for improved color comprehension in multimodal AI. https://arxiv.org/abs//2504.10514 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

Duration:00:07:49

COLORBENCH: Can VLMs See and Understand the Colorful World?

Duration:00:20:40

[QA] ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

https://arxiv.org/abs//2504.11536 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

Duration:00:08:33

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Duration:00:14:57

[QA] Looking beyond the next token

The paper presents TRELAWNEY, a method for rearranging training data to improve causal language models' performance in planning and reasoning without altering architecture, enhancing goal generation capabilities. https://arxiv.org/abs//2504.11336 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

Duration:00:07:22

Looking beyond the next token

Duration:00:16:58

[QA] How to Predict Best Pretraining Data with Small Experiments

The paper introduces DATADECIDE, a suite for evaluating data selection methods, revealing that small-scale model rankings effectively predict larger model performance, enhancing cost-efficient pretraining decisions. https://arxiv.org/abs//2504.11393 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

Duration:00:08:16

How to Predict Best Pretraining Data with Small Experiments

Duration:00:20:22

[QA] Have we unified image generation and understanding yet? An empirical study of GPT-4o's image generation ability

This study evaluates OpenAI's GPT-4o, revealing limitations in semantic synthesis, instruction adherence, and reasoning, challenging assumptions about its multimodal capabilities and calling for improved benchmarks and training strategies. https://arxiv.org/abs//2504.08003 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

Duration:00:07:18

Have we unified image generation and understanding yet? An empirical study of GPT-4o's image generation ability

Duration:00:07:07

[QA] DUMP: Automated Distribution-Level Curriculum Learning for RL-based LLM Post-training

This paper introduces a distribution-level curriculum learning framework for RL-based post-training of LLMs, enhancing reasoning capabilities by adaptively scheduling training across diverse data distributions. https://arxiv.org/abs//2504.09710 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

Duration:00:07:39

DUMP: Automated Distribution-Level Curriculum Learning for RL-based LLM Post-training

Duration:00:10:11

[QA] Steering CLIP's vision transformer with sparse autoencoders

This study explores sparse autoencoders in vision models, revealing unique processing patterns and enhancing steerability, leading to improved performance in vision disentanglement tasks and defense strategies. https://arxiv.org/abs//2504.08729 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

Duration:00:08:11

Steering CLIP's vision transformer with sparse autoencoders

Duration:00:17:53

[QA] Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning

Genius is an unsupervised self-training framework that enhances LLM reasoning without external supervision, using stepwise foresight re-sampling and advantage-calibrated optimization to improve performance. https://arxiv.org/abs//2504.08672 YouTube: https://www.youtube.com/@ArxivPapers TikTok: https://www.tiktok.com/@arxiv_papers Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016 Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

Duration:00:07:58

Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning