9 22 3

Xiang Liu

Dominic789654

https://dominic789654.github.io/

Dominic789654

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

authored a paper about 2 months ago

The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve?

upvoted a paper about 2 months ago

The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve?

View all activity

Organizations

None yet

Dominic789654's activity

upvoted a paper 6 days ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published 6 days ago • 86

authored a paper about 2 months ago

The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve?

Paper • 2502.17535 • Published Feb 24 • 8

upvoted a paper about 2 months ago

The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve?

Paper • 2502.17535 • Published Feb 24 • 8

commented a paper about 2 months ago

The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve?

Paper • 2502.17535 • Published Feb 24 • 8 •

updated a dataset 2 months ago

Dominic789654/aime

Viewer • Updated Feb 20 • 30 • 20

published a dataset 2 months ago

Dominic789654/aime

Viewer • Updated Feb 20 • 30 • 20

liked a Space 2 months ago

2.51k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

authored a paper 2 months ago

Perovskite-LLM: Knowledge-Enhanced Large Language Models for Perovskite Solar Cell Research

Paper • 2502.12669 • Published Feb 18 • 2

upvoted a paper 2 months ago

Perovskite-LLM: Knowledge-Enhanced Large Language Models for Perovskite Solar Cell Research

Paper • 2502.12669 • Published Feb 18 • 2

commented a paper 2 months ago

Perovskite-LLM: Knowledge-Enhanced Large Language Models for Perovskite Solar Cell Research

Paper • 2502.12669 • Published Feb 18 • 2 •

authored a paper 2 months ago

Mediator: Memory-efficient LLM Merging with Less Parameter Conflicts and Uncertainty Based Routing

Paper • 2502.04411 • Published Feb 6 • 4

upvoted a paper 2 months ago

Mediator: Memory-efficient LLM Merging with Less Parameter Conflicts and Uncertainty Based Routing

Paper • 2502.04411 • Published Feb 6 • 4

commented a paper 2 months ago

Mediator: Memory-efficient LLM Merging with Less Parameter Conflicts and Uncertainty Based Routing

Paper • 2502.04411 • Published Feb 6 • 4 •

authored a paper 3 months ago

Can LLMs Maintain Fundamental Abilities under KV Cache Compression?

Paper • 2502.01941 • Published Feb 4 • 15

upvoted a paper 3 months ago

Can LLMs Maintain Fundamental Abilities under KV Cache Compression?

Paper • 2502.01941 • Published Feb 4 • 15

commented a paper 3 months ago

Can LLMs Maintain Fundamental Abilities under KV Cache Compression?

Paper • 2502.01941 • Published Feb 4 • 15 •

authored a paper 3 months ago

ChunkKV: Semantic-Preserving KV Cache Compression for Efficient Long-Context LLM Inference

Paper • 2502.00299 • Published Feb 1 • 2

upvoted a paper 3 months ago

ChunkKV: Semantic-Preserving KV Cache Compression for Efficient Long-Context LLM Inference

Paper • 2502.00299 • Published Feb 1 • 2

commented a paper 3 months ago

ChunkKV: Semantic-Preserving KV Cache Compression for Efficient Long-Context LLM Inference

Paper • 2502.00299 • Published Feb 1 • 2 •

upvoted a paper 3 months ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published Jan 22 • 91