π― Want to Upskill in IT? Try Our FREE 2026 Learning Kits!
SPOTO gives you free, instant access to high-quality, updated resources that help you study smarter and pass exams faster.
β Latest Exam Materials:
Covering #Python, #Cisco, #PMI, #Fortinet, #AWS, #Azure, #AI, #Excel, #comptia, #ITIL, #cloud & more!
β 100% Free, No Sign-up:
All materials are instantly downloadable
β Whatβs Inside:
γ»πIT Certs E-book: https://bit.ly/3Mlu5ez
γ»πIT Exams Skill Test: https://bit.ly/3NVrgRU
γ»πFree IT courses: https://bit.ly/3M9h5su
γ»π€Free PMP Study Guide: https://bit.ly/4te3EIn
γ»βοΈFree Cloud Study Guide: https://bit.ly/4kgFVDs
π Become Part of Our IT Learning Circle! resources and support:
https://chat.whatsapp.com/FlG2rOYVySLEHLKXF3nKGB
π¬ Want exam help? Chat with an admin now!
wa.link/8fy3x4
SPOTO gives you free, instant access to high-quality, updated resources that help you study smarter and pass exams faster.
β Latest Exam Materials:
Covering #Python, #Cisco, #PMI, #Fortinet, #AWS, #Azure, #AI, #Excel, #comptia, #ITIL, #cloud & more!
β 100% Free, No Sign-up:
All materials are instantly downloadable
β Whatβs Inside:
γ»πIT Certs E-book: https://bit.ly/3Mlu5ez
γ»πIT Exams Skill Test: https://bit.ly/3NVrgRU
γ»πFree IT courses: https://bit.ly/3M9h5su
γ»π€Free PMP Study Guide: https://bit.ly/4te3EIn
γ»βοΈFree Cloud Study Guide: https://bit.ly/4kgFVDs
π Become Part of Our IT Learning Circle! resources and support:
https://chat.whatsapp.com/FlG2rOYVySLEHLKXF3nKGB
π¬ Want exam help? Chat with an admin now!
wa.link/8fy3x4
β¨Didactic to Constructive: Turning Expert Solutions into Learnable Reasoning
π Summary:
DAIL improves LLM reasoning by converting didactic expert solutions into detailed, in-distribution traces via contrastive learning. This method achieves 10-25% performance gains and 2-4x reasoning efficiency using minimal expert data.
πΉ Publication Date: Published on Feb 2
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2602.02405
β’ PDF: https://arxiv.org/pdf/2602.02405
β’ Github: https://github.com/ethanm88/DAIL
β¨ Datasets citing this paper:
β’ https://huggingface.co/datasets/emendes3/e1-proof
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
π Summary:
DAIL improves LLM reasoning by converting didactic expert solutions into detailed, in-distribution traces via contrastive learning. This method achieves 10-25% performance gains and 2-4x reasoning efficiency using minimal expert data.
πΉ Publication Date: Published on Feb 2
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2602.02405
β’ PDF: https://arxiv.org/pdf/2602.02405
β’ Github: https://github.com/ethanm88/DAIL
β¨ Datasets citing this paper:
β’ https://huggingface.co/datasets/emendes3/e1-proof
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
β¨Feedback by Design: Understanding and Overcoming User Feedback Barriers in Conversational Agents
π Summary:
High-quality feedback is essential for effective human-AI interaction. It bridges knowledge gaps, corrects digressions, and shapes system behavior; both during interaction and throughout model develop...
πΉ Publication Date: Published on Feb 1
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2602.01405
β’ PDF: https://arxiv.org/pdf/2602.01405
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
π Summary:
High-quality feedback is essential for effective human-AI interaction. It bridges knowledge gaps, corrects digressions, and shapes system behavior; both during interaction and throughout model develop...
πΉ Publication Date: Published on Feb 1
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2602.01405
β’ PDF: https://arxiv.org/pdf/2602.01405
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
β¨Scaling Small Agents Through Strategy Auctions
π Summary:
Small language models fail on complex tasks. The paper proposes Strategy Auctions for Workload Efficiency SALE, a marketplace-inspired framework where agents bid strategic plans for task routing and self-improvement. SALE reduces costs by 35% and improves performance, enabling small agents to sca...
πΉ Publication Date: Published on Feb 2
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2602.02751
β’ PDF: https://arxiv.org/pdf/2602.02751
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
π Summary:
Small language models fail on complex tasks. The paper proposes Strategy Auctions for Workload Efficiency SALE, a marketplace-inspired framework where agents bid strategic plans for task routing and self-improvement. SALE reduces costs by 35% and improves performance, enabling small agents to sca...
πΉ Publication Date: Published on Feb 2
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2602.02751
β’ PDF: https://arxiv.org/pdf/2602.02751
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
β¨MemoryLLM: Plug-n-Play Interpretable Feed-Forward Memory for Transformers
π Summary:
MemoryLLM decouples feed-forward networks from self-attention in transformers, enabling context-free token-wise neural retrieval memory that improves inference efficiency through pre-computed lookups....
πΉ Publication Date: Published on Jan 30
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2602.00398
β’ PDF: https://arxiv.org/pdf/2602.00398
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
π Summary:
MemoryLLM decouples feed-forward networks from self-attention in transformers, enabling context-free token-wise neural retrieval memory that improves inference efficiency through pre-computed lookups....
πΉ Publication Date: Published on Jan 30
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2602.00398
β’ PDF: https://arxiv.org/pdf/2602.00398
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
β¨Position: Agentic Evolution is the Path to Evolving LLMs
π Summary:
Large language models struggle to adapt to changing real-world environments. Agentic evolution is proposed as a new approach where deployment-time improvement becomes a goal-directed optimization process. This allows for sustained, open-ended adaptation by scaling evolution.
πΉ Publication Date: Published on Jan 30
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2602.00359
β’ PDF: https://arxiv.org/pdf/2602.00359
β’ Github: https://github.com/ventr1c/agentic-evoluiton
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
π Summary:
Large language models struggle to adapt to changing real-world environments. Agentic evolution is proposed as a new approach where deployment-time improvement becomes a goal-directed optimization process. This allows for sustained, open-ended adaptation by scaling evolution.
πΉ Publication Date: Published on Jan 30
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2602.00359
β’ PDF: https://arxiv.org/pdf/2602.00359
β’ Github: https://github.com/ventr1c/agentic-evoluiton
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
β€1
β¨MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe
π Summary:
MiniCPM-V 4.5 is an 8B multimodal LLM achieving high performance and efficiency. It uses a unified 3D-Resampler, unified learning, and hybrid reinforcement learning. It surpasses larger models like GPT-4o and Qwen2.5-VL with significantly less memory and faster inference.
πΉ Publication Date: Published on Sep 16, 2025
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2509.18154
β’ PDF: https://arxiv.org/pdf/2509.18154
β’ Github: https://github.com/OpenBMB/MiniCPM-V
πΉ Models citing this paper:
β’ https://huggingface.co/openbmb/MiniCPM-V-4_5
β’ https://huggingface.co/openbmb/MiniCPM-V-4_5-gguf
β’ https://huggingface.co/openbmb/MiniCPM-V-4_5-AWQ
β¨ Datasets citing this paper:
β’ https://huggingface.co/datasets/openbmb/RLAIF-V-Dataset
β¨ Spaces citing this paper:
β’ https://huggingface.co/spaces/CGQN/MiniCPM-V-4_5-int4-CPU-0
β’ https://huggingface.co/spaces/CGQN/MiniCPM-V-4_5-from_gpt5
β’ https://huggingface.co/spaces/CGQN/MiniCPM-V-4_5
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
π Summary:
MiniCPM-V 4.5 is an 8B multimodal LLM achieving high performance and efficiency. It uses a unified 3D-Resampler, unified learning, and hybrid reinforcement learning. It surpasses larger models like GPT-4o and Qwen2.5-VL with significantly less memory and faster inference.
πΉ Publication Date: Published on Sep 16, 2025
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2509.18154
β’ PDF: https://arxiv.org/pdf/2509.18154
β’ Github: https://github.com/OpenBMB/MiniCPM-V
πΉ Models citing this paper:
β’ https://huggingface.co/openbmb/MiniCPM-V-4_5
β’ https://huggingface.co/openbmb/MiniCPM-V-4_5-gguf
β’ https://huggingface.co/openbmb/MiniCPM-V-4_5-AWQ
β¨ Datasets citing this paper:
β’ https://huggingface.co/datasets/openbmb/RLAIF-V-Dataset
β¨ Spaces citing this paper:
β’ https://huggingface.co/spaces/CGQN/MiniCPM-V-4_5-int4-CPU-0
β’ https://huggingface.co/spaces/CGQN/MiniCPM-V-4_5-from_gpt5
β’ https://huggingface.co/spaces/CGQN/MiniCPM-V-4_5
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
arXiv.org
MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and...
Multimodal Large Language Models (MLLMs) are undergoing rapid progress and represent the frontier of AI development. However, their training and inference efficiency have emerged as a core...
β¨Privasis: Synthesizing the Largest "Public" Private Dataset from Scratch
π Summary:
Privasis is a new million-scale synthetic dataset for AI privacy research. It addresses data scarcity, enabling compact sanitization models that outperform large language models like GPT-5. The diverse dataset and models will be released to the public.
πΉ Publication Date: Published on Feb 3
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2602.03183
β’ PDF: https://arxiv.org/pdf/2602.03183
β’ Project Page: https://privasis.github.io
β’ Github: https://github.com/skywalker023/privasis
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
π Summary:
Privasis is a new million-scale synthetic dataset for AI privacy research. It addresses data scarcity, enabling compact sanitization models that outperform large language models like GPT-5. The diverse dataset and models will be released to the public.
πΉ Publication Date: Published on Feb 3
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2602.03183
β’ PDF: https://arxiv.org/pdf/2602.03183
β’ Project Page: https://privasis.github.io
β’ Github: https://github.com/skywalker023/privasis
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
β¨FIRE-Bench: Evaluating Agents on the Rediscovery of Scientific Insights
π Summary:
FIRE-Bench evaluates AI agents on rediscovering scientific findings through full research cycles, from hypothesis to conclusions. Agents receive a high-level question and act autonomously. Current agents struggle, showing that reliable AI-driven scientific discovery remains challenging.
πΉ Publication Date: Published on Feb 2
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2602.02905
β’ PDF: https://arxiv.org/pdf/2602.02905
β’ Project Page: https://firebench.github.io/
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
π Summary:
FIRE-Bench evaluates AI agents on rediscovering scientific findings through full research cycles, from hypothesis to conclusions. Agents receive a high-level question and act autonomously. Current agents struggle, showing that reliable AI-driven scientific discovery remains challenging.
πΉ Publication Date: Published on Feb 2
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2602.02905
β’ PDF: https://arxiv.org/pdf/2602.02905
β’ Project Page: https://firebench.github.io/
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
β¨UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning
π Summary:
UI-TARS-2 is a native GUI agent model that tackles challenges in data scalability and multi-turn reinforcement learning. It significantly improves over its predecessor and strong baselines on GUI and game benchmarks, demonstrating robust generalization. This advances GUI agents for real-world int...
πΉ Publication Date: Published on Sep 2, 2025
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2509.02544
β’ PDF: https://arxiv.org/pdf/2509.02544
β’ Project Page: https://seed-tars.com/showcase/ui-tars-2/
β’ Github: https://github.com/bytedance/ui-tars
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
π Summary:
UI-TARS-2 is a native GUI agent model that tackles challenges in data scalability and multi-turn reinforcement learning. It significantly improves over its predecessor and strong baselines on GUI and game benchmarks, demonstrating robust generalization. This advances GUI agents for real-world int...
πΉ Publication Date: Published on Sep 2, 2025
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2509.02544
β’ PDF: https://arxiv.org/pdf/2509.02544
β’ Project Page: https://seed-tars.com/showcase/ui-tars-2/
β’ Github: https://github.com/bytedance/ui-tars
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
This media is not supported in your browser
VIEW IN TELEGRAM
β¨SoMA: A Real-to-Sim Neural Simulator for Robotic Soft-body Manipulation
π Summary:
SoMA is a 3D Gaussian Splat simulator that enables stable, long-horizon manipulation of soft bodies by coupling deformable dynamics, environmental forces, and robot actions in a unified latent neural ...
πΉ Publication Date: Published on Feb 2
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2602.02402
β’ PDF: https://arxiv.org/pdf/2602.02402
β’ Project Page: https://huggingface.co/collections/SuemH/project-page
β’ Github: https://city-super.github.io/SoMA/
πΉ Models citing this paper:
β’ https://huggingface.co/SuemH/SoMA
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
π Summary:
SoMA is a 3D Gaussian Splat simulator that enables stable, long-horizon manipulation of soft bodies by coupling deformable dynamics, environmental forces, and robot actions in a unified latent neural ...
πΉ Publication Date: Published on Feb 2
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2602.02402
β’ PDF: https://arxiv.org/pdf/2602.02402
β’ Project Page: https://huggingface.co/collections/SuemH/project-page
β’ Github: https://city-super.github.io/SoMA/
πΉ Models citing this paper:
β’ https://huggingface.co/SuemH/SoMA
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
β¨A-RAG: Scaling Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces
π Summary:
Agentic RAG framework enables models to dynamically adapt retrieval decisions across multiple granularities, outperforming traditional approaches while scaling efficiently with model improvements. AI-...
πΉ Publication Date: Published on Feb 3
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2602.03442
β’ PDF: https://arxiv.org/pdf/2602.03442
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
π Summary:
Agentic RAG framework enables models to dynamically adapt retrieval decisions across multiple granularities, outperforming traditional approaches while scaling efficiently with model improvements. AI-...
πΉ Publication Date: Published on Feb 3
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2602.03442
β’ PDF: https://arxiv.org/pdf/2602.03442
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
β¨Self-Hinting Language Models Enhance Reinforcement Learning
π Summary:
SAGE is an on-policy reinforcement learning framework that enhances GRPO by injecting self-hints during training to increase outcome diversity under sparse rewards, improving alignment of large langua...
πΉ Publication Date: Published on Feb 3
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2602.03143
β’ PDF: https://arxiv.org/pdf/2602.03143
β’ Github: https://github.com/BaohaoLiao/SAGE
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
π Summary:
SAGE is an on-policy reinforcement learning framework that enhances GRPO by injecting self-hints during training to increase outcome diversity under sparse rewards, improving alignment of large langua...
πΉ Publication Date: Published on Feb 3
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2602.03143
β’ PDF: https://arxiv.org/pdf/2602.03143
β’ Github: https://github.com/BaohaoLiao/SAGE
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
β¨Context Learning for Multi-Agent Discussion
π Summary:
Multi-Agent Discussion methods suffer from inconsistency due to individual context misalignment, which is addressed through a context learning approach that dynamically generates context instructions ...
πΉ Publication Date: Published on Feb 2
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2602.02350
β’ PDF: https://arxiv.org/pdf/2602.02350
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
π Summary:
Multi-Agent Discussion methods suffer from inconsistency due to individual context misalignment, which is addressed through a context learning approach that dynamically generates context instructions ...
πΉ Publication Date: Published on Feb 2
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2602.02350
β’ PDF: https://arxiv.org/pdf/2602.02350
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
β¨A2Eval: Agentic and Automated Evaluation for Embodied Brain
π Summary:
Agentic automatic evaluation framework automates embodied vision-language model assessment through collaborative agents that reduce evaluation costs and improve ranking accuracy. AI-generated summary ...
πΉ Publication Date: Published on Feb 2
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2602.01640
β’ PDF: https://arxiv.org/pdf/2602.01640
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
π Summary:
Agentic automatic evaluation framework automates embodied vision-language model assessment through collaborative agents that reduce evaluation costs and improve ranking accuracy. AI-generated summary ...
πΉ Publication Date: Published on Feb 2
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2602.01640
β’ PDF: https://arxiv.org/pdf/2602.01640
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
β¨UI-TARS: Pioneering Automated GUI Interaction with Native Agents
π Summary:
UI-TARS, a native GUI agent model using screenshots as input, outperforms commercial models in various benchmarks through enhanced perception, unified action modeling, system-2 reasoning, and iterativ...
πΉ Publication Date: Published on Jan 21, 2025
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2501.12326
β’ PDF: https://arxiv.org/pdf/2501.12326
β’ Github: https://github.com/bytedance/UI-TARS
πΉ Models citing this paper:
β’ https://huggingface.co/ByteDance-Seed/UI-TARS-1.5-7B
β’ https://huggingface.co/ByteDance-Seed/UI-TARS-7B-DPO
β’ https://huggingface.co/ByteDance-Seed/UI-TARS-7B-SFT
β¨ Datasets citing this paper:
β’ https://huggingface.co/datasets/Hcompany/WebClick
β¨ Spaces citing this paper:
β’ https://huggingface.co/spaces/omar0scarf/ui-tars-api
β’ https://huggingface.co/spaces/bytedance-research/UI-TARS
β’ https://huggingface.co/spaces/Aheader/gui_test_app
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
π Summary:
UI-TARS, a native GUI agent model using screenshots as input, outperforms commercial models in various benchmarks through enhanced perception, unified action modeling, system-2 reasoning, and iterativ...
πΉ Publication Date: Published on Jan 21, 2025
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2501.12326
β’ PDF: https://arxiv.org/pdf/2501.12326
β’ Github: https://github.com/bytedance/UI-TARS
πΉ Models citing this paper:
β’ https://huggingface.co/ByteDance-Seed/UI-TARS-1.5-7B
β’ https://huggingface.co/ByteDance-Seed/UI-TARS-7B-DPO
β’ https://huggingface.co/ByteDance-Seed/UI-TARS-7B-SFT
β¨ Datasets citing this paper:
β’ https://huggingface.co/datasets/Hcompany/WebClick
β¨ Spaces citing this paper:
β’ https://huggingface.co/spaces/omar0scarf/ui-tars-api
β’ https://huggingface.co/spaces/bytedance-research/UI-TARS
β’ https://huggingface.co/spaces/Aheader/gui_test_app
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
arXiv.org
UI-TARS: Pioneering Automated GUI Interaction with Native Agents
This paper introduces UI-TARS, a native GUI agent model that solely perceives the screenshots as input and performs human-like interactions (e.g., keyboard and mouse operations). Unlike prevailing...
β¨Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization
π Summary:
Quant VideoGen addresses KV cache memory limitations in autoregressive video diffusion models through semantic-aware smoothing and progressive residual quantization, achieving significant memory reduc...
πΉ Publication Date: Published on Feb 3
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2602.02958
β’ PDF: https://arxiv.org/pdf/2602.02958
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
π Summary:
Quant VideoGen addresses KV cache memory limitations in autoregressive video diffusion models through semantic-aware smoothing and progressive residual quantization, achieving significant memory reduc...
πΉ Publication Date: Published on Feb 3
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2602.02958
β’ PDF: https://arxiv.org/pdf/2602.02958
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
β¨EgoActor: Grounding Task Planning into Spatial-aware Egocentric Actions for Humanoid Robots via Visual-Language Models
π Summary:
EgoActor is a unified vision-language model that translates high-level instructions into precise humanoid robot actions through integrated perception and execution across simulated and real-world envi...
πΉ Publication Date: Published on Feb 4
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2602.04515
β’ PDF: https://arxiv.org/pdf/2602.04515
β’ Github: https://baai-agents.github.io/EgoActor/
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
π Summary:
EgoActor is a unified vision-language model that translates high-level instructions into precise humanoid robot actions through integrated perception and execution across simulated and real-world envi...
πΉ Publication Date: Published on Feb 4
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2602.04515
β’ PDF: https://arxiv.org/pdf/2602.04515
β’ Github: https://baai-agents.github.io/EgoActor/
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
β¨PaperSearchQA: Learning to Search and Reason over Scientific Papers with RLVR
π Summary:
Search agents trained on scientific paper corpora demonstrate advanced reasoning capabilities for technical question-answering tasks, outperforming traditional retrieval methods through reinforcement ...
πΉ Publication Date: Published on Jan 26
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2601.18207
β’ PDF: https://arxiv.org/pdf/2601.18207
β’ Project Page: https://jmhb0.github.io/PaperSearchQA/
β’ Github: https://jmhb0.github.io/PaperSearchQA/
β¨ Datasets citing this paper:
β’ https://huggingface.co/datasets/jmhb/PaperSearchQA
β’ https://huggingface.co/datasets/jmhb/pubmed_bioasq_2022
β’ https://huggingface.co/datasets/jmhb/bioasq_factoid
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
π Summary:
Search agents trained on scientific paper corpora demonstrate advanced reasoning capabilities for technical question-answering tasks, outperforming traditional retrieval methods through reinforcement ...
πΉ Publication Date: Published on Jan 26
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2601.18207
β’ PDF: https://arxiv.org/pdf/2601.18207
β’ Project Page: https://jmhb0.github.io/PaperSearchQA/
β’ Github: https://jmhb0.github.io/PaperSearchQA/
β¨ Datasets citing this paper:
β’ https://huggingface.co/datasets/jmhb/PaperSearchQA
β’ https://huggingface.co/datasets/jmhb/pubmed_bioasq_2022
β’ https://huggingface.co/datasets/jmhb/bioasq_factoid
==================================
For more data science resources:
β https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research