Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
54.8
TFLOPS
22
1
Piotr Wilkin
ilintar
Follow
fearlessdots's profile picture
spooner2's profile picture
Doctor-Chad-PhD's profile picture
13 followers
·
14 following
ilintar
pwilkin
piotr-wilkin-0011771ba
ilintar
AI & ML interests
None yet
Recent Activity
reacted
to
onekq
's
post
with 👍
1 day ago
Context rot is such a catchy phrase, but the problem has been identified 2+ years ago, called attention decay. https://huggingface.co/papers/2307.03172 I spotted the same problem in coding tasks, and documented in my book (https://www.amazon.com/dp/9999331130). Why did this problem become hot again? This is because many of us thought the problem has been solved by long context models, which is not true. Here we were misled by benchmarks. Most long-context benchmarks build around the QA scenario, i.e. "finding needle in haystack". But in agentic scenarios, the model needs to find EVERYTHING in the haystack, and just can't afford enough attention for this challenge.
updated
a model
1 day ago
ilintar/Qwen3-Next-80B-A3B-Instruct-GGUF
new
activity
4 days ago
ilintar/Qwen3-Next-80B-A3B-Instruct-GGUF:
Fix model name (not A30B, but A3B)
View all activity
Organizations
models
10
Sort: Recently updated
ilintar/Qwen3-Next-80B-A3B-Instruct-GGUF
80B
•
Updated
1 day ago
•
3.04k
•
9
ilintar/NVIDIA-Nemotron-Nano-9B-v2-GGUF
9B
•
Updated
Aug 29
•
19
•
1
ilintar/Dhanishtha-2.0-preview-0825-Q3_K_M-GGUF
Text Generation
•
15B
•
Updated
Aug 2
•
25
ilintar/HelpingAI-Dhanishtha-2.0-preview-0725-GGUF
Updated
Jul 20
ilintar/ERNIE-4.5-21B-A3B-PT-gguf
22B
•
Updated
Jul 15
•
71
•
2
ilintar/Apriel-Nemotron-15b-Thinker-iGGUF
15B
•
Updated
May 7
•
13
ilintar/THUDM-GLM-4-32B-0414-IQ2_S.GGUF
33B
•
Updated
Apr 22
•
1
•
1
ilintar/THUDM_GLM-Z1-9B-0414_iGGUF
9B
•
Updated
Apr 22
•
11
•
3
ilintar/SWE-Dev-7B-iGGUF
8B
•
Updated
Apr 22
•
3
•
1
ilintar/Llama-3-1-Nemotron-Nano-8B-v1-i-GGUF
Text Generation
•
8B
•
Updated
Mar 21
datasets
0
None public yet