24 39 2047

Rosswill

Kutches

AI & ML interests

Recent Activity

liked a Space about 1 hour ago

DraconicDragon/cl_tagger

liked a Space about 1 hour ago

GeneralGost/wd-tagger-mdf

updated a model about 3 hours ago

Kutches/IL-VAE

View all activity

Organizations

None yet

upvoted a paper 2 days ago

DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization

Paper • 2508.14460 • Published 3 days ago • 72

upvoted a paper 3 days ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published 17 days ago • 102

upvoted a paper 5 days ago

XQuant: Breaking the Memory Wall for LLM Inference with KV Cache Rematerialization

Paper • 2508.10395 • Published 9 days ago • 37

upvoted 3 papers 15 days ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published 16 days ago • 155

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published 17 days ago • 117

DeepPHY: Benchmarking Agentic VLMs on Physical Reasoning

Paper • 2508.05405 • Published 16 days ago • 62

upvoted a paper 18 days ago

SitEmb-v1.5: Improved Context-Aware Dense Retrieval for Semantic Association and Long Story Comprehension

Paper • 2508.01959 • Published 20 days ago • 56

upvoted an article 19 days ago

Article

Towards Open Evolutionary Agents

and 1 other •

19 days ago

• 14

upvoted an article 20 days ago

Article

Unsupervised Model Improvement via Internal Coherence Maximization: Outperforming Human-Supervised Methods Through Self-Elicitation

•

20 days ago

• 6

upvoted a collection 23 days ago

Wan 2.2 GGUFs

Collection

5 items • Updated 4 days ago • 2

upvoted a paper 24 days ago

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning

Paper • 2507.14111 • Published Jul 18 • 22

upvoted an article 29 days ago

Article

Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨

and 2 others •

30 days ago

• 79

upvoted 8 papers about 1 month ago

Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning

Paper • 2507.16784 • Published Jul 22 • 118

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Paper • 2507.16812 • Published Jul 22 • 62

Upsample What Matters: Region-Adaptive Latent Sampling for Accelerated Diffusion Transformers

Paper • 2507.08422 • Published Jul 11 • 35

GUI-G^2: Gaussian Reward Modeling for GUI Grounding

Paper • 2507.15846 • Published Jul 21 • 131

SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?

Paper • 2507.12415 • Published Jul 16 • 41

Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models

Paper • 2507.07104 • Published Jul 9 • 45

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Paper • 2507.10532 • Published Jul 14 • 85

T-LoRA: Single Image Diffusion Model Customization Without Overfitting

Paper • 2507.05964 • Published Jul 8 • 115

Rosswill

AI & ML interests

Recent Activity

Organizations

Kutches's activity

Towards Open Evolutionary Agents

Unsupervised Model Improvement via Internal Coherence Maximization: Outperforming Human-Supervised Methods Through Self-Elicitation

Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨