✨FABLE: Forest-Based Adaptive Bi-Path LLM-Enhanced Retrieval for Multi-Document Reasoning
📝 Summary:
FABLE is a new retrieval framework enhancing LLM-based multi-document reasoning through hierarchical forest indexes and a bi-path strategy. It outperforms traditional RAG with up to 94 percent token reduction, proving the ongoing need for structured retrieval.
🔹 Publication Date: Published on Jan 26
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18116
• PDF: https://arxiv.org/pdf/2601.18116
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#LLM #InformationRetrieval #MultiDocumentReasoning #RAG #NLP
📝 Summary:
FABLE is a new retrieval framework enhancing LLM-based multi-document reasoning through hierarchical forest indexes and a bi-path strategy. It outperforms traditional RAG with up to 94 percent token reduction, proving the ongoing need for structured retrieval.
🔹 Publication Date: Published on Jan 26
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18116
• PDF: https://arxiv.org/pdf/2601.18116
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#LLM #InformationRetrieval #MultiDocumentReasoning #RAG #NLP
❤2
✨HalluCitation Matters: Revealing the Impact of Hallucinated References with 300 Hallucinated Papers in ACL Conferences
📝 Summary:
Hallucinated citations HalluCitation are a growing problem in NLP papers. This study found nearly 300 papers from 2024-2025 contain HalluCitations, with a rapid increase at EMNLP 2025, threatening scientific reliability and conference credibility.
🔹 Publication Date: Published on Jan 26
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18724
• PDF: https://arxiv.org/pdf/2601.18724
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#HalluCitation #NLP #ResearchIntegrity #AI #AcademicPublishing
📝 Summary:
Hallucinated citations HalluCitation are a growing problem in NLP papers. This study found nearly 300 papers from 2024-2025 contain HalluCitations, with a rapid increase at EMNLP 2025, threatening scientific reliability and conference credibility.
🔹 Publication Date: Published on Jan 26
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18724
• PDF: https://arxiv.org/pdf/2601.18724
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#HalluCitation #NLP #ResearchIntegrity #AI #AcademicPublishing
❤1👍1
✨Benchmarks Saturate When The Model Gets Smarter Than The Judge
📝 Summary:
This paper introduces Omni-MATH-2, a manually audited mathematical benchmark dataset to reduce noise. It reveals that existing judges like Omni-Judge are highly inaccurate, masking real model performance differences. Accurate benchmarks require both high-quality datasets and more competent judges.
🔹 Publication Date: Published on Jan 27
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.19532
• PDF: https://arxiv.org/pdf/2601.19532
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #MachineLearning #Benchmarking #ModelEvaluation #Datasets
📝 Summary:
This paper introduces Omni-MATH-2, a manually audited mathematical benchmark dataset to reduce noise. It reveals that existing judges like Omni-Judge are highly inaccurate, masking real model performance differences. Accurate benchmarks require both high-quality datasets and more competent judges.
🔹 Publication Date: Published on Jan 27
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.19532
• PDF: https://arxiv.org/pdf/2601.19532
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #MachineLearning #Benchmarking #ModelEvaluation #Datasets
❤1
✨Post-LayerNorm Is Back: Stable, ExpressivE, and Deep
📝 Summary:
Keel is a novel Post-LayerNorm Transformer using Highway-style connections instead of residual ones. This enables stable training of networks over 1000 layers deep, preventing gradient vanishing and improving expressivity for LLMs.
🔹 Publication Date: Published on Jan 27
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.19895
• PDF: https://arxiv.org/pdf/2601.19895
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#Transformers #DeepLearning #LLM #NeuralNetworks #AIResearch
📝 Summary:
Keel is a novel Post-LayerNorm Transformer using Highway-style connections instead of residual ones. This enables stable training of networks over 1000 layers deep, preventing gradient vanishing and improving expressivity for LLMs.
🔹 Publication Date: Published on Jan 27
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.19895
• PDF: https://arxiv.org/pdf/2601.19895
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#Transformers #DeepLearning #LLM #NeuralNetworks #AIResearch
❤1
✨EvolVE: Evolutionary Search for LLM-based Verilog Generation and Optimization
📝 Summary:
EvolVE improves LLM-based Verilog generation and optimization through evolutionary search. It uses MCTS for correctness and IGR for optimization, accelerated by STG. EvolVE achieves state-of-the-art performance and reduces PPA on industry-scale designs.
🔹 Publication Date: Published on Jan 26
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18067
• PDF: https://arxiv.org/pdf/2601.18067
• Github: https://github.com/weiber2002/ICRTL
✨ Datasets citing this paper:
• https://huggingface.co/datasets/weiber2002/ICRTL
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#LLM #Verilog #EvolutionaryAlgorithms #HardwareDesign #AI
📝 Summary:
EvolVE improves LLM-based Verilog generation and optimization through evolutionary search. It uses MCTS for correctness and IGR for optimization, accelerated by STG. EvolVE achieves state-of-the-art performance and reduces PPA on industry-scale designs.
🔹 Publication Date: Published on Jan 26
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18067
• PDF: https://arxiv.org/pdf/2601.18067
• Github: https://github.com/weiber2002/ICRTL
✨ Datasets citing this paper:
• https://huggingface.co/datasets/weiber2002/ICRTL
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#LLM #Verilog #EvolutionaryAlgorithms #HardwareDesign #AI
❤1
❗️LISA HELPS EVERYONE EARN MONEY!$29,000 HE'S GIVING AWAY TODAY!
Everyone can join his channel and make money! He gives away from $200 to $5.000 every day in his channel
https://t.iss.one/+HDFF3Mo_t68zNWQy
⚡️FREE ONLY FOR THE FIRST 500 SUBSCRIBERS! FURTHER ENTRY IS PAID! 👆👇
https://t.iss.one/+HDFF3Mo_t68zNWQy
Everyone can join his channel and make money! He gives away from $200 to $5.000 every day in his channel
https://t.iss.one/+HDFF3Mo_t68zNWQy
⚡️FREE ONLY FOR THE FIRST 500 SUBSCRIBERS! FURTHER ENTRY IS PAID! 👆👇
https://t.iss.one/+HDFF3Mo_t68zNWQy
✨DeFM: Learning Foundation Representations from Depth for Robotics
📝 Summary:
DeFM is a self-supervised foundation model for depth representation learning in robotics. It learns geometric and semantic features from 60M depth images, achieving state-of-the-art performance across diverse robotic tasks and strong sim-to-real generalization.
🔹 Publication Date: Published on Jan 26
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18923
• PDF: https://arxiv.org/pdf/2601.18923
• Github: https://de-fm.github.io/
🔹 Models citing this paper:
• https://huggingface.co/leggedrobotics/defm
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#Robotics #FoundationModels #SelfSupervisedLearning #ComputerVision #MachineLearning
📝 Summary:
DeFM is a self-supervised foundation model for depth representation learning in robotics. It learns geometric and semantic features from 60M depth images, achieving state-of-the-art performance across diverse robotic tasks and strong sim-to-real generalization.
🔹 Publication Date: Published on Jan 26
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.18923
• PDF: https://arxiv.org/pdf/2601.18923
• Github: https://de-fm.github.io/
🔹 Models citing this paper:
• https://huggingface.co/leggedrobotics/defm
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#Robotics #FoundationModels #SelfSupervisedLearning #ComputerVision #MachineLearning
❤1
✨HyperAlign: Hypernetwork for Efficient Test-Time Alignment of Diffusion Models
📝 Summary:
HyperAlign uses a hypernetwork to efficiently align diffusion models at test-time. It dynamically adjusts denoising trajectories based on input conditions, improving semantic consistency and visual appeal. This outperforms existing methods.
🔹 Publication Date: Published on Jan 22
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.15968
• PDF: https://arxiv.org/pdf/2601.15968
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#DiffusionModels #Hypernetworks #GenerativeAI #AIResearch #DeepLearning
📝 Summary:
HyperAlign uses a hypernetwork to efficiently align diffusion models at test-time. It dynamically adjusts denoising trajectories based on input conditions, improving semantic consistency and visual appeal. This outperforms existing methods.
🔹 Publication Date: Published on Jan 22
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.15968
• PDF: https://arxiv.org/pdf/2601.15968
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#DiffusionModels #Hypernetworks #GenerativeAI #AIResearch #DeepLearning
❤2
✨Towards Pixel-Level VLM Perception via Simple Points Prediction
📝 Summary:
SimpleSeg enables MLLMs to perform pixel-level segmentation by predicting point sequences in language space. A two-stage training with reinforcement learning refines these points. This simple method achieves competitive results, showing MLLMs have inherent low-level perception without specialized...
🔹 Publication Date: Published on Jan 27
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.19228
• PDF: https://arxiv.org/pdf/2601.19228
• Project Page: https://simpleseg.github.io/
• Github: https://github.com/songtianhui/SimpleSeg
🔹 Models citing this paper:
• https://huggingface.co/sthui/SimpleSeg-Kimi-VL
• https://huggingface.co/sthui/SimpleSeg-Qwen2.5-VL
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#VLM #MLLM #ImageSegmentation #DeepLearning #AIResearch
📝 Summary:
SimpleSeg enables MLLMs to perform pixel-level segmentation by predicting point sequences in language space. A two-stage training with reinforcement learning refines these points. This simple method achieves competitive results, showing MLLMs have inherent low-level perception without specialized...
🔹 Publication Date: Published on Jan 27
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.19228
• PDF: https://arxiv.org/pdf/2601.19228
• Project Page: https://simpleseg.github.io/
• Github: https://github.com/songtianhui/SimpleSeg
🔹 Models citing this paper:
• https://huggingface.co/sthui/SimpleSeg-Kimi-VL
• https://huggingface.co/sthui/SimpleSeg-Qwen2.5-VL
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#VLM #MLLM #ImageSegmentation #DeepLearning #AIResearch
❤1
✨Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision
📝 Summary:
Youtu-VL introduces a Vision-Language Unified Autoregressive Supervision paradigm. It shifts from vision-as-input to vision-as-target, integrating visual tokens into the prediction stream. This improves multimodal comprehension and vision-centric task performance, fostering generalist visual agents.
🔹 Publication Date: Published on Jan 27
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.19798
• PDF: https://arxiv.org/pdf/2601.19798
• Project Page: https://youtu-tip.com/#llm
• Github: https://github.com/TencentCloudADP/youtu-vl
🔹 Models citing this paper:
• https://huggingface.co/tencent/Youtu-VL-4B-Instruct
• https://huggingface.co/tencent/Youtu-VL-4B-Instruct-GGUF
• https://huggingface.co/tencent/Youtu-Parsing
✨ Spaces citing this paper:
• https://huggingface.co/spaces/tencent/Youtu-Parsing
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#YoutuVL #VisionLanguage #MultimodalAI #ComputerVision #DeepLearning
📝 Summary:
Youtu-VL introduces a Vision-Language Unified Autoregressive Supervision paradigm. It shifts from vision-as-input to vision-as-target, integrating visual tokens into the prediction stream. This improves multimodal comprehension and vision-centric task performance, fostering generalist visual agents.
🔹 Publication Date: Published on Jan 27
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.19798
• PDF: https://arxiv.org/pdf/2601.19798
• Project Page: https://youtu-tip.com/#llm
• Github: https://github.com/TencentCloudADP/youtu-vl
🔹 Models citing this paper:
• https://huggingface.co/tencent/Youtu-VL-4B-Instruct
• https://huggingface.co/tencent/Youtu-VL-4B-Instruct-GGUF
• https://huggingface.co/tencent/Youtu-Parsing
✨ Spaces citing this paper:
• https://huggingface.co/spaces/tencent/Youtu-Parsing
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#YoutuVL #VisionLanguage #MultimodalAI #ComputerVision #DeepLearning
arXiv.org
Youtu-VL: Unleashing Visual Potential via Unified Vision-Language...
Despite the significant advancements represented by Vision-Language Models (VLMs), current architectures often exhibit limitations in retaining fine-grained visual information, leading to...
✨CooperBench: Why Coding Agents Cannot be Your Teammates Yet
📝 Summary:
AI agents lack social intelligence for teamwork. CooperBench, a new collaborative coding benchmark, shows agents perform 30% worse together than individually. This 'curse of coordination' is due to poor communication, broken commitments, and incorrect expectations, calling for AI to develop socia...
🔹 Publication Date: Published on Jan 19
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.13295
• PDF: https://arxiv.org/pdf/2601.13295
• Project Page: https://cooperbench.com
• Github: https://github.com/cooperbench/CooperBench
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
AI agents lack social intelligence for teamwork. CooperBench, a new collaborative coding benchmark, shows agents perform 30% worse together than individually. This 'curse of coordination' is due to poor communication, broken commitments, and incorrect expectations, calling for AI to develop socia...
🔹 Publication Date: Published on Jan 19
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.13295
• PDF: https://arxiv.org/pdf/2601.13295
• Project Page: https://cooperbench.com
• Github: https://github.com/cooperbench/CooperBench
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
❤1
✨Self-Distillation Enables Continual Learning
📝 Summary:
Self-Distillation Fine-Tuning enables on-policy continual learning from demonstrations. It uses the model as its own teacher to acquire new skills while preserving prior knowledge. This method significantly reduces catastrophic forgetting and allows models to accumulate multiple skills over time.
🔹 Publication Date: Published on Jan 27
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.19897
• PDF: https://arxiv.org/pdf/2601.19897
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Self-Distillation Fine-Tuning enables on-policy continual learning from demonstrations. It uses the model as its own teacher to acquire new skills while preserving prior knowledge. This method significantly reduces catastrophic forgetting and allows models to accumulate multiple skills over time.
🔹 Publication Date: Published on Jan 27
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.19897
• PDF: https://arxiv.org/pdf/2601.19897
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
🔥1
✨GDCNet: Generative Discrepancy Comparison Network for Multimodal Sarcasm Detection
📝 Summary:
A multimodal sarcasm detection approach uses generative models to create stable semantic anchors and measures cross-modal discrepancies for improved accuracy and robustness. AI-generated summary Multi...
🔹 Publication Date: Published on Jan 28
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20618
• PDF: https://arxiv.org/pdf/2601.20618
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
A multimodal sarcasm detection approach uses generative models to create stable semantic anchors and measures cross-modal discrepancies for improved accuracy and robustness. AI-generated summary Multi...
🔹 Publication Date: Published on Jan 28
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20618
• PDF: https://arxiv.org/pdf/2601.20618
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
❤1
✨Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation
📝 Summary:
MathForge enhances mathematical reasoning in large models through a dual framework combining difficulty-aware policy optimization and multi-aspect question reformulation to address limitations in exis...
🔹 Publication Date: Published on Jan 28
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20614
• PDF: https://arxiv.org/pdf/2601.20614
• Github: https://github.com/AMAP-ML/MathForge
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
MathForge enhances mathematical reasoning in large models through a dual framework combining difficulty-aware policy optimization and multi-aspect question reformulation to address limitations in exis...
🔹 Publication Date: Published on Jan 28
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20614
• PDF: https://arxiv.org/pdf/2601.20614
• Github: https://github.com/AMAP-ML/MathForge
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨RIR-Mega-Speech: A Reverberant Speech Corpus with Comprehensive Acoustic Metadata and Reproducible Evaluation
📝 Summary:
A large-scale reverberant speech corpus with detailed acoustic annotations is introduced to facilitate standardized comparison and reproduction of speech processing research. AI-generated summary Desp...
🔹 Publication Date: Published on Jan 25
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.19949
• PDF: https://arxiv.org/pdf/2601.19949
• Project Page: https://huggingface.co/datasets/mandipgoswami/rir-mega-speech
✨ Datasets citing this paper:
• https://huggingface.co/datasets/mandipgoswami/rirmega
• https://huggingface.co/datasets/mandipgoswami/rir-mega-speech
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
A large-scale reverberant speech corpus with detailed acoustic annotations is introduced to facilitate standardized comparison and reproduction of speech processing research. AI-generated summary Desp...
🔹 Publication Date: Published on Jan 25
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.19949
• PDF: https://arxiv.org/pdf/2601.19949
• Project Page: https://huggingface.co/datasets/mandipgoswami/rir-mega-speech
✨ Datasets citing this paper:
• https://huggingface.co/datasets/mandipgoswami/rirmega
• https://huggingface.co/datasets/mandipgoswami/rir-mega-speech
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
Media is too big
VIEW IN TELEGRAM
✨Advancing Open-source World Models
📝 Summary:
LingBot-World is an open-source world simulator offering high-fidelity dynamics in diverse environments. It features long-term memory and real-time interactivity. This release empowers the community for applications like content creation, gaming, and robot learning.
🔹 Publication Date: Published on Jan 28
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20540
• PDF: https://arxiv.org/pdf/2601.20540
• Project Page: https://technology.robbyant.com/lingbot-world
• Github: https://github.com/Robbyant/lingbot-world/
🔹 Models citing this paper:
• https://huggingface.co/robbyant/lingbot-world-base-cam
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
LingBot-World is an open-source world simulator offering high-fidelity dynamics in diverse environments. It features long-term memory and real-time interactivity. This release empowers the community for applications like content creation, gaming, and robot learning.
🔹 Publication Date: Published on Jan 28
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20540
• PDF: https://arxiv.org/pdf/2601.20540
• Project Page: https://technology.robbyant.com/lingbot-world
• Github: https://github.com/Robbyant/lingbot-world/
🔹 Models citing this paper:
• https://huggingface.co/robbyant/lingbot-world-base-cam
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨SketchDynamics: Exploring Free-Form Sketches for Dynamic Intent Expression in Animation Generation
📝 Summary:
Free-form sketching enables intuitive dynamic intent communication for automated content creation, bridging human intention and digital output in animation workflows. AI-generated summary Sketching pr...
🔹 Publication Date: Published on Jan 28
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20622
• PDF: https://arxiv.org/pdf/2601.20622
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Free-form sketching enables intuitive dynamic intent communication for automated content creation, bridging human intention and digital output in animation workflows. AI-generated summary Sketching pr...
🔹 Publication Date: Published on Jan 28
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20622
• PDF: https://arxiv.org/pdf/2601.20622
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨DeepSeek-OCR 2: Visual Causal Flow
📝 Summary:
DeepSeek-OCR 2 introduces DeepEncoder V2 that dynamically reorders visual tokens based on semantic content, enabling more human-like causal reasoning in 2D image understanding through cascaded 1D caus...
🔹 Publication Date: Published on Jan 28
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20552
• PDF: https://arxiv.org/pdf/2601.20552
• Github: https://github.com/deepseek-ai/DeepSeek-OCR-2
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
DeepSeek-OCR 2 introduces DeepEncoder V2 that dynamically reorders visual tokens based on semantic content, enabling more human-like causal reasoning in 2D image understanding through cascaded 1D caus...
🔹 Publication Date: Published on Jan 28
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20552
• PDF: https://arxiv.org/pdf/2601.20552
• Github: https://github.com/deepseek-ai/DeepSeek-OCR-2
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Spark: Strategic Policy-Aware Exploration via Dynamic Branching for Long-Horizon Agentic Learning
📝 Summary:
Spark is a reinforcement learning framework that strategically allocates computational resources by branching at critical decision states, improving sample efficiency and generalization for long-horiz...
🔹 Publication Date: Published on Jan 28
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20209
• PDF: https://arxiv.org/pdf/2601.20209
🔹 Models citing this paper:
• https://huggingface.co/Jinyang23/Spark-1.5B-ALFWorld
• https://huggingface.co/Jinyang23/Spark-1.5B-ScienceWorld
• https://huggingface.co/Jinyang23/Spark-1.5B-WebShop
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Spark is a reinforcement learning framework that strategically allocates computational resources by branching at critical decision states, improving sample efficiency and generalization for long-horiz...
🔹 Publication Date: Published on Jan 28
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20209
• PDF: https://arxiv.org/pdf/2601.20209
🔹 Models citing this paper:
• https://huggingface.co/Jinyang23/Spark-1.5B-ALFWorld
• https://huggingface.co/Jinyang23/Spark-1.5B-ScienceWorld
• https://huggingface.co/Jinyang23/Spark-1.5B-WebShop
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Linear representations in language models can change dramatically over a conversation
📝 Summary:
Linear representation directions in language models dynamically shift during conversations, affecting how factual information is encoded while preserving generic content, with implications for interpr...
🔹 Publication Date: Published on Jan 28
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20834
• PDF: https://arxiv.org/pdf/2601.20834
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Linear representation directions in language models dynamically shift during conversations, affecting how factual information is encoded while preserving generic content, with implications for interpr...
🔹 Publication Date: Published on Jan 28
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.20834
• PDF: https://arxiv.org/pdf/2601.20834
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research