DrishtiSharma (Drishti Sharma)

upvoted a collection 5 months ago

SigLIP2

Collection

36 items • Updated Jul 10, 2025 • 104

upvoted an article 6 months ago

Article

Vision Language Models (Better, faster, stronger)

+3

May 12, 2025

•

580

upvoted a collection 6 months ago

Multimodal Benchmarks

Collection

248 items • Updated 5 days ago • 27

upvoted an article 6 months ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Jul 9, 2025

•

751

upvoted a paper 8 months ago

The Leaderboard Illusion

Paper • 2504.20879 • Published Apr 29, 2025 • 72

upvoted an article 9 months ago

Article

MedEmbed: Fine-Tuned Embedding Models for Medical / Clinical IR

Oct 20, 2024

•

52

upvoted 3 papers 10 months ago

upvoted an article 11 months ago

Article

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

+1

Feb 19, 2025

•

74

upvoted 10 papers 11 months ago

Soundwave: Less is More for Speech-Text Alignment in LLMs

Paper • 2502.12900 • Published Feb 18, 2025 • 86

IHEval: Evaluating Language Models on Following the Instruction Hierarchy

Paper • 2502.08745 • Published Feb 12, 2025 • 20

ReLearn: Unlearning via Learning for Large Language Models

Paper • 2502.11190 • Published Feb 16, 2025 • 30

How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training

Paper • 2502.11196 • Published Feb 16, 2025 • 23

Logical Reasoning in Large Language Models: A Survey

Paper • 2502.09100 • Published Feb 13, 2025 • 24

An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging

Paper • 2502.09056 • Published Feb 13, 2025 • 31

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Paper • 2502.09604 • Published Feb 13, 2025 • 37

Skrr: Skip and Re-use Text Encoder Layers for Memory Efficient Text-to-Image Generation

Paper • 2502.08690 • Published Feb 12, 2025 • 43

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published Feb 13, 2025 • 148

SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators

Paper • 2502.06394 • Published Feb 10, 2025 • 89

Drishti Sharma

AI & ML interests

Organizations

SigLIP2

Vision Language Models (Better, faster, stronger)

Multimodal Benchmarks

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

The Leaderboard Illusion

MedEmbed: Fine-Tuned Embedding Models for Medical / Clinical IR

LLM as a Broken Telephone: Iterative Generation Distorts Information

The Lessons of Developing Process Reward Models in Mathematical Reasoning

START: Self-taught Reasoner with Tools

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

Soundwave: Less is More for Speech-Text Alignment in LLMs

IHEval: Evaluating Language Models on Following the Instruction Hierarchy

ReLearn: Unlearning via Learning for Large Language Models

How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training

Logical Reasoning in Large Language Models: A Survey

An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Skrr: Skip and Re-use Text Encoder Layers for Memory Efficient Text-to-Image Generation

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators

Drishti Sharma

AI & ML interests

Organizations

DrishtiSharma's activity

Vision Language Models (Better, faster, stronger)

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

MedEmbed: Fine-Tuned Embedding Models for Medical / Clinical IR

PaliGemma 2 Mix - New Instruction Vision Language Models by Google