wangshuai's picture

6 15 9

wangshuai

wangsssssss

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

new activity 4 days ago

MCG-NJU/DMM:Add library name and license

authored a paper 5 days ago

DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging

View all activity

Organizations

wangsssssss's activity

upvoted a paper 2 days ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published 4 days ago • 87

upvoted 4 papers 5 days ago

DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging

Paper • 2504.12364 • Published 7 days ago • 18

VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models

Paper • 2504.13122 • Published 5 days ago • 21

Antidistillation Sampling

Paper • 2504.13146 • Published 5 days ago • 59

Packing Input Frame Context in Next-Frame Prediction Models for Video Generation

Paper • 2504.12626 • Published 6 days ago • 45

upvoted 2 papers 7 days ago

Efficient Generative Model Training via Embedded Representation Warmup

Paper • 2504.10188 • Published 9 days ago • 12

Seedream 3.0 Technical Report

Paper • 2504.11346 • Published 8 days ago • 42

upvoted 2 papers 9 days ago

PixelFlow: Pixel-Space Generative Models with Flow

Paper • 2504.07963 • Published 12 days ago • 19

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Paper • 2504.08685 • Published 12 days ago • 120

upvoted a paper 12 days ago

Kimi-VL Technical Report

Paper • 2504.07491 • Published 13 days ago • 118

upvoted a paper 13 days ago

DDT: Decoupled Diffusion Transformer

Paper • 2504.05741 • Published 15 days ago • 73

upvoted a paper about 1 month ago

Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model

Paper • 2503.07703 • Published Mar 10 • 35

upvoted a paper 4 months ago

Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Paper • 2501.01423 • Published Jan 2 • 42

upvoted a paper 5 months ago

FlowDCN: Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution

Paper • 2410.22655 • Published Oct 30, 2024 • 1

upvoted a paper 8 months ago

CogVLM2: Visual Language Models for Image and Video Understanding

Paper • 2408.16500 • Published Aug 29, 2024 • 58