Dmitriev
Danil
·
AI & ML interests
NLP, Multilingual, mLM, Dialog systems, Graph NN
Organizations
None yet
RAG
-
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Paper • 2310.11511 • Published • 78 -
PEARL: Personalizing Large Language Model Writing Assistants with Generation-Calibrated Retrievers
Paper • 2311.09180 • Published • 8 -
Enhancing Structured-Data Retrieval with GraphRAG: Soccer Data Case Study
Paper • 2409.17580 • Published • 8 -
StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization
Paper • 2410.08815 • Published • 47
agent
-
MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models
Paper • 2310.11954 • Published • 25 -
Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs
Paper • 2311.05657 • Published • 30 -
AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases
Paper • 2407.12784 • Published • 51
Multimodal
Read
-
Instruction Following without Instruction Tuning
Paper • 2409.14254 • Published • 29 -
Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models
Paper • 2409.18943 • Published • 28 -
Only-IF:Revealing the Decisive Effect of Instruction Diversity on Generalization
Paper • 2410.04717 • Published • 18 -
RevisEval: Improving LLM-as-a-Judge via Response-Adapted References
Paper • 2410.05193 • Published • 13
follow instructions
-
IOPO: Empowering LLMs with Complex Instruction Following via Input-Output Preference Optimization
Paper • 2411.06208 • Published • 21 -
Instruction Following without Instruction Tuning
Paper • 2409.14254 • Published • 29 -
Toward General Instruction-Following Alignment for Retrieval-Augmented Generation
Paper • 2410.09584 • Published • 48
adapters
music
LLM
-
FlashDecoding++: Faster Large Language Model Inference on GPUs
Paper • 2311.01282 • Published • 37 -
Unifying the Perspectives of NLP and Software Engineering: A Survey on Language Models for Code
Paper • 2311.07989 • Published • 26 -
When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method
Paper • 2402.17193 • Published • 26 -
Training Language Models to Self-Correct via Reinforcement Learning
Paper • 2409.12917 • Published • 140
reasoning
follow instructions
-
IOPO: Empowering LLMs with Complex Instruction Following via Input-Output Preference Optimization
Paper • 2411.06208 • Published • 21 -
Instruction Following without Instruction Tuning
Paper • 2409.14254 • Published • 29 -
Toward General Instruction-Following Alignment for Retrieval-Augmented Generation
Paper • 2410.09584 • Published • 48
RAG
-
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Paper • 2310.11511 • Published • 78 -
PEARL: Personalizing Large Language Model Writing Assistants with Generation-Calibrated Retrievers
Paper • 2311.09180 • Published • 8 -
Enhancing Structured-Data Retrieval with GraphRAG: Soccer Data Case Study
Paper • 2409.17580 • Published • 8 -
StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization
Paper • 2410.08815 • Published • 47
adapters
agent
-
MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models
Paper • 2310.11954 • Published • 25 -
Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs
Paper • 2311.05657 • Published • 30 -
AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases
Paper • 2407.12784 • Published • 51
music
Multimodal
LLM
-
FlashDecoding++: Faster Large Language Model Inference on GPUs
Paper • 2311.01282 • Published • 37 -
Unifying the Perspectives of NLP and Software Engineering: A Survey on Language Models for Code
Paper • 2311.07989 • Published • 26 -
When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method
Paper • 2402.17193 • Published • 26 -
Training Language Models to Self-Correct via Reinforcement Learning
Paper • 2409.12917 • Published • 140
Read
-
Instruction Following without Instruction Tuning
Paper • 2409.14254 • Published • 29 -
Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models
Paper • 2409.18943 • Published • 28 -
Only-IF:Revealing the Decisive Effect of Instruction Diversity on Generalization
Paper • 2410.04717 • Published • 18 -
RevisEval: Improving LLM-as-a-Judge via Response-Adapted References
Paper • 2410.05193 • Published • 13