ML Research Hub
32.9K subscribers
4.45K photos
273 videos
23 files
4.81K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research

📝 Summary:
WebWeaver is a dual-agent framework addressing open-ended deep research challenges. It uses dynamic planning interleaving evidence acquisition and outline optimization and hierarchical, targeted writing to overcome long-context issues. This approach produces state-of-the-art, high-quality, reliab...

🔹 Publication Date: Published on Sep 16

🔹 Paper Links:
• arXiv Page: https://arxivexplained.com/papers/webweaver-structuring-web-scale-evidence-with-dynamic-outlines-for-open-ended-deep-research
• PDF: https://arxiv.org/pdf/2509.13312
• Project Page: https://tongyi-agent.github.io/blog/
• Github: https://tongyi-agent.github.io/blog/

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#AI #Research #AgentSystems #LLM #KnowledgeManagement
DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing

📝 Summary:
DocETL is an agent-based system that optimizes complex document processing pipelines to significantly improve LLM accuracy. It uses logical rewriting and agent-guided evaluation to achieve 1.34 to 4.6 times higher quality outputs than current baselines.

🔹 Publication Date: Published on Oct 16, 2024

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2410.12189
• PDF: https://arxiv.org/pdf/2410.12189
• Github: https://github.com/ucbepic/docetl

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#LLM #AI #DocumentProcessing #AgentSystems #NaturalLanguageProcessing
1