Developers are deploying malware-based tarpits like Nepenthes and Iocaine to trap and poison AI web crawlers that ignore robots.txt. These tools waste resources, mislead AI models with gibberish, and inflate AI companies' data collection costs.
⚙️ The Details
→ Nepenthes, created by an anonymous developer, traps AI crawlers in an infinite maze of static files, preventing them from reaching real content and feeding them Markov babble to poison AI training data.
→ The tarpitting technique, originally used against email spammers, is now weaponized against AI crawlers accused of violating robots.txt rules.
→ OpenAI is the only major AI company whose crawler has evaded Nepenthes, while others struggle to bypass it.
→ Iocaine, inspired by Nepenthes, goes further by intentionally poisoning AI datasets using a reverse proxy to serve misleading data.
→ Critics argue that tarpits could be ineffective long-term, as AI companies improve data filtering, but supporters see them as a form of resistance against AI exploitation.
→ AI firms continue to develop poisoning countermeasures, but the growing cost and complexity of AI training may be impacted by widespread adoption of such tactics.
Vuelvo a aclarar:
El unico proposito de este twit es mostrar que no era facil para una persona comun comprar este instrumento o memecoin.
No es Endorsement, tutorial o recomendacion.
La descripcion es al solo efecto de ver lo dificil que es comprar para un inversor normal, que no pertenece al ecosistema.
No confundan a la gente.