20 17 21

Yozh

justheuristic

justheuristic

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

commented on a paper 7 days ago

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

upvoted a paper 8 days ago

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

View all activity

Organizations

justheuristic's activity

upvoted a paper 1 day ago

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

Paper • 2504.08791 • Published 15 days ago • 119

commented a paper 7 days ago

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Paper • 2504.06261 • Published 14 days ago • 103 •

upvoted a paper 8 days ago

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published Jan 8 • 98

upvoted 5 papers 12 days ago

An Empirical Study of GPT-4o Image Generation Capabilities

Paper • 2504.05979 • Published 14 days ago • 61

Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought

Paper • 2504.05599 • Published 15 days ago • 80

HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference

Paper • 2504.05897 • Published 14 days ago • 13

Accelerate Parallelizable Reasoning via Parallel Decoding within One Sequence

Paper • 2503.20533 • Published 27 days ago • 12

OmniSVG: A Unified Scalable Vector Graphics Generation Model

Paper • 2504.06263 • Published 14 days ago • 148

upvoted 2 papers 13 days ago

Pushing the Limits of Large Language Model Quantization via the Linearity Theorem

Paper • 2411.17525 • Published Nov 26, 2024 • 4

Extreme Compression of Large Language Models via Additive Quantization

Paper • 2401.06118 • Published Jan 11, 2024 • 13

commented a paper 13 days ago

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Paper • 2504.06261 • Published 14 days ago • 103 •

upvoted a paper 13 days ago

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Paper • 2504.06261 • Published 14 days ago • 103

liked a model 23 days ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 27 days ago • 1.76M • • 12k

upvoted a paper about 1 month ago

Scale-wise Distillation of Diffusion Models

Paper • 2503.16397 • Published Mar 20 • 39

liked a model about 2 months ago

yandex/YandexGPT-5-Lite-8B-pretrain

Updated 22 days ago • 5.44k • 187

liked a model 5 months ago

PrimeIntellect/INTELLECT-1-Instruct

Text Generation • Updated Nov 29, 2024 • 50 • 122

upvoted an article 9 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 356

updated 2 models 10 months ago

ISTA-DASLab/Phi-3-medium-4k-instruct-AQLM-PV-1Bit-1x16-hf

Text Generation • Updated Jul 8, 2024 • 1

ISTA-DASLab/Phi-3-medium-4k-instruct-AQLM-PV-2Bit-1x16-hf

Text Generation • Updated Jul 8, 2024