Data Science | Machine Learning with Python for Researchers
32.5K subscribers
3.11K photos
107 videos
22 files
3.33K links
ads: @HusseinSheikho

The Data Science and Python channel is for researchers and advanced programmers

Buy ads: https://telega.io/c/dataScienceT
Download Telegram
WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization

📝 Summary:
WebShaper synthesizes information-seeking datasets to address data scarcity for LLM agents. It uses a formalization-driven framework based on set theory and Knowledge Projections, enabling precise control over reasoning structure. This leads to state-of-the-art performance on open-sourced benchma...

🔹 Publication Date: Published on Jul 20

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2507.15061
• PDF: https://arxiv.org/pdf/2507.15061
• Project Page: https://huggingface.co/papers?q=Knowledge%20Projections%20(KP)
• Github: https://github.com/Alibaba-NLP/WebAgent

🔹 Models citing this paper:
https://huggingface.co/Alibaba-NLP/WebShaper-32B

Datasets citing this paper:
https://huggingface.co/datasets/Alibaba-NLP/WebShaper

==================================

For more data science resources:
https://t.iss.one/DataScienceT

#LLM #AIAgents #DataGeneration #FormalMethods #NLP