✨G-LNS: Generative Large Neighborhood Search for LLM-Based Automatic Heuristic Design
📝 Summary:
A generative evolutionary framework extends large language models for automated design of large neighborhood search operators in combinatorial optimization problems. AI-generated summary While Large L...
🔹 Publication Date: Published on Feb 9
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.08253
• PDF: https://arxiv.org/pdf/2602.08253
• Project Page: https://zboyn.github.io/G-LNS/
• Github: https://github.com/ZBoyn/G-LNS
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
A generative evolutionary framework extends large language models for automated design of large neighborhood search operators in combinatorial optimization problems. AI-generated summary While Large L...
🔹 Publication Date: Published on Feb 9
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.08253
• PDF: https://arxiv.org/pdf/2602.08253
• Project Page: https://zboyn.github.io/G-LNS/
• Github: https://github.com/ZBoyn/G-LNS
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Internalizing Meta-Experience into Memory for Guided Reinforcement Learning in Large Language Models
📝 Summary:
Meta-Experience Learning enhances LLM reasoning by incorporating self-distilled error representations into parametric memory through contrastive trajectory analysis and language-modeled reward signals...
🔹 Publication Date: Published on Feb 10
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.10224
• PDF: https://arxiv.org/pdf/2602.10224
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Meta-Experience Learning enhances LLM reasoning by incorporating self-distilled error representations into parametric memory through contrastive trajectory analysis and language-modeled reward signals...
🔹 Publication Date: Published on Feb 10
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.10224
• PDF: https://arxiv.org/pdf/2602.10224
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters
📝 Summary:
Step 3.5 Flash is a sparse Mixture-of-Experts model that achieves frontier-level agentic intelligence through efficient parameter utilization and optimized attention mechanisms, demonstrating strong p...
🔹 Publication Date: Published on Feb 11
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.10604
• PDF: https://arxiv.org/pdf/2602.10604
• Github: https://github.com/stepfun-ai/Step-3.5-Flash
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Step 3.5 Flash is a sparse Mixture-of-Experts model that achieves frontier-level agentic intelligence through efficient parameter utilization and optimized attention mechanisms, demonstrating strong p...
🔹 Publication Date: Published on Feb 11
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.10604
• PDF: https://arxiv.org/pdf/2602.10604
• Github: https://github.com/stepfun-ai/Step-3.5-Flash
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨When the Prompt Becomes Visual: Vision-Centric Jailbreak Attacks for Large Image Editing Models
📝 Summary:
Visual-to-visual jailbreak attacks compromise image editing models through malicious visual inputs, necessitating new safety benchmarks and defense mechanisms. AI-generated summary Recent advances in ...
🔹 Publication Date: Published on Feb 10
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.10179
• PDF: https://arxiv.org/pdf/2602.10179
• Github: https://csu-jpg.github.io/vja.github.io/
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Visual-to-visual jailbreak attacks compromise image editing models through malicious visual inputs, necessitating new safety benchmarks and defense mechanisms. AI-generated summary Recent advances in ...
🔹 Publication Date: Published on Feb 10
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.10179
• PDF: https://arxiv.org/pdf/2602.10179
• Github: https://csu-jpg.github.io/vja.github.io/
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨ArcFlow: Unleashing 2-Step Text-to-Image Generation via High-Precision Non-Linear Flow Distillation
📝 Summary:
ArcFlow is a few-step distillation framework that uses non-linear flow trajectories to approximate teacher diffusion models, achieving fast inference with minimal quality loss through lightweight adap...
🔹 Publication Date: Published on Feb 9
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.09014
• PDF: https://arxiv.org/pdf/2602.09014
• Github: https://github.com/pnotp/ArcFlow
🔹 Models citing this paper:
• https://huggingface.co/ymyy307/ArcFlow
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
ArcFlow is a few-step distillation framework that uses non-linear flow trajectories to approximate teacher diffusion models, achieving fast inference with minimal quality loss through lightweight adap...
🔹 Publication Date: Published on Feb 9
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.09014
• PDF: https://arxiv.org/pdf/2602.09014
• Github: https://github.com/pnotp/ArcFlow
🔹 Models citing this paper:
• https://huggingface.co/ymyy307/ArcFlow
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨When Actions Go Off-Task: Detecting and Correcting Misaligned Actions in Computer-Use Agents
📝 Summary:
Computer-use agents face safety risks from misaligned actions caused by external attacks or internal limitations, prompting the development of DeAction, a guardrail that detects and corrects such acti...
🔹 Publication Date: Published on Feb 9
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.08995
• PDF: https://arxiv.org/pdf/2602.08995
• Project Page: https://osu-nlp-group.github.io/Misaligned-Action-Detection/
• Github: https://github.com/OSU-NLP-Group/Misaligned-Action-Detection
✨ Datasets citing this paper:
• https://huggingface.co/datasets/osunlp/MisActBench
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Computer-use agents face safety risks from misaligned actions caused by external attacks or internal limitations, prompting the development of DeAction, a guardrail that detects and corrects such acti...
🔹 Publication Date: Published on Feb 9
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.08995
• PDF: https://arxiv.org/pdf/2602.08995
• Project Page: https://osu-nlp-group.github.io/Misaligned-Action-Detection/
• Github: https://github.com/OSU-NLP-Group/Misaligned-Action-Detection
✨ Datasets citing this paper:
• https://huggingface.co/datasets/osunlp/MisActBench
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨AgenticPay: A Multi-Agent LLM Negotiation System for Buyer-Seller Transactions
📝 Summary:
AgenticPay presents a benchmark and simulation framework for evaluating multi-agent language-mediated economic interactions, focusing on negotiation performance and strategic reasoning challenges in c...
🔹 Publication Date: Published on Feb 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.06008
• PDF: https://arxiv.org/pdf/2602.06008
• Project Page: https://agenticpay-tutorial.readthedocs.io/en/latest/
• Github: https://github.com/SafeRL-Lab/AgenticPay
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
AgenticPay presents a benchmark and simulation framework for evaluating multi-agent language-mediated economic interactions, focusing on negotiation performance and strategic reasoning challenges in c...
🔹 Publication Date: Published on Feb 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.06008
• PDF: https://arxiv.org/pdf/2602.06008
• Project Page: https://agenticpay-tutorial.readthedocs.io/en/latest/
• Github: https://github.com/SafeRL-Lab/AgenticPay
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning
📝 Summary:
GRU-Mem addresses long-context reasoning challenges in LLMs by incorporating text-controlled gates and reinforcement learning rewards to stabilize memory updates and improve computational efficiency. ...
🔹 Publication Date: Published on Feb 11
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.10560
• PDF: https://arxiv.org/pdf/2602.10560
• Project Page: https://alphalab-ustc.github.io/grumem-alphalab/
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
GRU-Mem addresses long-context reasoning challenges in LLMs by incorporating text-controlled gates and reinforcement learning rewards to stabilize memory updates and improve computational efficiency. ...
🔹 Publication Date: Published on Feb 11
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.10560
• PDF: https://arxiv.org/pdf/2602.10560
• Project Page: https://alphalab-ustc.github.io/grumem-alphalab/
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Towards Autonomous Mathematics Research
📝 Summary:
Aletheia, a math research agent, demonstrates advanced reasoning capabilities by generating and verifying solutions end-to-end in natural language, achieving autonomous research outcomes from Olympiad...
🔹 Publication Date: Published on Feb 10
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.10177
• PDF: https://arxiv.org/pdf/2602.10177
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Aletheia, a math research agent, demonstrates advanced reasoning capabilities by generating and verifying solutions end-to-end in natural language, achieving autonomous research outcomes from Olympiad...
🔹 Publication Date: Published on Feb 10
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.10177
• PDF: https://arxiv.org/pdf/2602.10177
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
❤1
✨CLI-Gym: Scalable CLI Task Generation via Agentic Environment Inversion
📝 Summary:
CLI-Gym enables scalable derivation of environment-intensive tasks by simulating and exploring environment histories, while LiberCoder achieves significant performance improvements on Terminal-Bench t...
🔹 Publication Date: Published on Feb 11
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.10999
• PDF: https://arxiv.org/pdf/2602.10999
• Github: https://github.com/LiberCoders/CLI-Gym
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
CLI-Gym enables scalable derivation of environment-intensive tasks by simulating and exploring environment histories, while LiberCoder achieves significant performance improvements on Terminal-Bench t...
🔹 Publication Date: Published on Feb 11
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.10999
• PDF: https://arxiv.org/pdf/2602.10999
• Github: https://github.com/LiberCoders/CLI-Gym
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨QP-OneModel: A Unified Generative LLM for Multi-Task Query Understanding in Xiaohongshu Search
📝 Summary:
A unified generative large language model approach for social network search query processing that improves semantic understanding through multi-task learning and reinforcement learning while enhancin...
🔹 Publication Date: Published on Feb 10
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.09901
• PDF: https://arxiv.org/pdf/2602.09901
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
A unified generative large language model approach for social network search query processing that improves semantic understanding through multi-task learning and reinforcement learning while enhancin...
🔹 Publication Date: Published on Feb 10
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.09901
• PDF: https://arxiv.org/pdf/2602.09901
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Online Causal Kalman Filtering for Stable and Effective Policy Optimization
📝 Summary:
Online Causal Kalman Filtering addresses high-variance token-level importance sampling in reinforcement learning for large language models by modeling IS ratios as evolving latent states and using Kal...
🔹 Publication Date: Published on Feb 11
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.10609
• PDF: https://arxiv.org/pdf/2602.10609
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Online Causal Kalman Filtering addresses high-variance token-level importance sampling in reinforcement learning for large language models by modeling IS ratios as evolving latent states and using Kal...
🔹 Publication Date: Published on Feb 11
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.10609
• PDF: https://arxiv.org/pdf/2602.10609
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨GENIUS: Generative Fluid Intelligence Evaluation Suite
📝 Summary:
GENIUS evaluates multimodal models' generative fluid intelligence through pattern induction, constraint execution, and contextual adaptation tasks, revealing deficiencies in context comprehension rath...
🔹 Publication Date: Published on Feb 11
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.11144
• PDF: https://arxiv.org/pdf/2602.11144
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
GENIUS evaluates multimodal models' generative fluid intelligence through pattern induction, constraint execution, and contextual adaptation tasks, revealing deficiencies in context comprehension rath...
🔹 Publication Date: Published on Feb 11
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.11144
• PDF: https://arxiv.org/pdf/2602.11144
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning
📝 Summary:
DataChef-32B automates data recipe generation for LLM adaptation through reinforcement learning with proxy rewards, achieving performance comparable to human-crafted recipes. AI-generated summary In t...
🔹 Publication Date: Published on Feb 11
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.11089
• PDF: https://arxiv.org/pdf/2602.11089
• Github: https://github.com/yichengchen24/DataChef
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
DataChef-32B automates data recipe generation for LLM adaptation through reinforcement learning with proxy rewards, achieving performance comparable to human-crafted recipes. AI-generated summary In t...
🔹 Publication Date: Published on Feb 11
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.11089
• PDF: https://arxiv.org/pdf/2602.11089
• Github: https://github.com/yichengchen24/DataChef
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨UMEM: Unified Memory Extraction and Management Framework for Generalizable Memory
📝 Summary:
A unified framework for memory extraction and management in LLM-based agents that improves generalization through semantic neighborhood modeling and marginal utility rewards. AI-generated summary Self...
🔹 Publication Date: Published on Feb 11
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.10652
• PDF: https://arxiv.org/pdf/2602.10652
• Github: https://github.com/AIDC-AI/Marco-DeepResearch
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
A unified framework for memory extraction and management in LLM-based agents that improves generalization through semantic neighborhood modeling and marginal utility rewards. AI-generated summary Self...
🔹 Publication Date: Published on Feb 11
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.10652
• PDF: https://arxiv.org/pdf/2602.10652
• Github: https://github.com/AIDC-AI/Marco-DeepResearch
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Spend Search Where It Pays: Value-Guided Structured Sampling and Optimization for Generative Recommendation
📝 Summary:
V-STAR improves generative recommendation by addressing the probability-reward mismatch that causes poor exploration and weak learning signals. It uses value-guided decoding for efficient exploration and sibling-relative advantages to focus reinforcement learning. This framework enhances accuracy...
🔹 Publication Date: Published on Feb 11
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.10699
• PDF: https://arxiv.org/pdf/2602.10699
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
V-STAR improves generative recommendation by addressing the probability-reward mismatch that causes poor exploration and weak learning signals. It uses value-guided decoding for efficient exploration and sibling-relative advantages to focus reinforcement learning. This framework enhances accuracy...
🔹 Publication Date: Published on Feb 11
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.10699
• PDF: https://arxiv.org/pdf/2602.10699
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
❤1
✨PhyCritic: Multimodal Critic Models for Physical AI
📝 Summary:
PhyCritic is a multimodal critic model designed for physical AI tasks through a two-stage RLVR pipeline that enhances perception and reasoning capabilities. AI-generated summary With the rapid develop...
🔹 Publication Date: Published on Feb 11
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.11124
• PDF: https://arxiv.org/pdf/2602.11124
• Project Page: https://research.nvidia.com/labs/lpr/phycritic
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
PhyCritic is a multimodal critic model designed for physical AI tasks through a two-stage RLVR pipeline that enhances perception and reasoning capabilities. AI-generated summary With the rapid develop...
🔹 Publication Date: Published on Feb 11
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.11124
• PDF: https://arxiv.org/pdf/2602.11124
• Project Page: https://research.nvidia.com/labs/lpr/phycritic
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨FeatureBench: Benchmarking Agentic Coding for Complex Feature Development
📝 Summary:
FeatureBench evaluates agentic coding performance in comprehensive feature-oriented development through execution-based assessments and automated task derivation from code repositories. AI-generated s...
🔹 Publication Date: Published on Feb 11
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.10975
• PDF: https://arxiv.org/pdf/2602.10975
• Project Page: https://libercoders.github.io/FeatureBench/
• Github: https://github.com/LiberCoders/FeatureBench
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
FeatureBench evaluates agentic coding performance in comprehensive feature-oriented development through execution-based assessments and automated task derivation from code repositories. AI-generated s...
🔹 Publication Date: Published on Feb 11
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.10975
• PDF: https://arxiv.org/pdf/2602.10975
• Project Page: https://libercoders.github.io/FeatureBench/
• Github: https://github.com/LiberCoders/FeatureBench
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨TimeChat-Captioner: Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions
📝 Summary:
Omni Dense Captioning introduces a six-dimensional structural schema for generating time-aware audio-visual narratives with explicit timestamps, along with a unified evaluation metric and strong basel...
🔹 Publication Date: Published on Feb 9
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.08711
• PDF: https://arxiv.org/pdf/2602.08711
• Github: https://github.com/yaolinli/TimeChat-Captioner
🔹 Models citing this paper:
• https://huggingface.co/yaolily/TimeChat-Captioner-GRPO-7B
✨ Datasets citing this paper:
• https://huggingface.co/datasets/yaolily/TimeChat-OmniCap-42K
• https://huggingface.co/datasets/yaolily/OmniDCBench
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Omni Dense Captioning introduces a six-dimensional structural schema for generating time-aware audio-visual narratives with explicit timestamps, along with a unified evaluation metric and strong basel...
🔹 Publication Date: Published on Feb 9
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.08711
• PDF: https://arxiv.org/pdf/2602.08711
• Github: https://github.com/yaolinli/TimeChat-Captioner
🔹 Models citing this paper:
• https://huggingface.co/yaolily/TimeChat-Captioner-GRPO-7B
✨ Datasets citing this paper:
• https://huggingface.co/datasets/yaolily/TimeChat-OmniCap-42K
• https://huggingface.co/datasets/yaolily/OmniDCBench
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨ASA: Training-Free Representation Engineering for Tool-Calling Agents
📝 Summary:
A training-free method called Activation Steering Adapter corrects tool calling behavior in language models by using mid-layer activation interventions guided by a probe and router-conditioned steerin...
🔹 Publication Date: Published on Feb 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.04935
• PDF: https://arxiv.org/pdf/2602.04935
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
A training-free method called Activation Steering Adapter corrects tool calling behavior in language models by using mid-layer activation interventions guided by a probe and router-conditioned steerin...
🔹 Publication Date: Published on Feb 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.04935
• PDF: https://arxiv.org/pdf/2602.04935
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Free(): Learning to Forget in Malloc-Only Reasoning Models
📝 Summary:
Free()LM addresses reasoning model limitations by introducing a self-forgetting mechanism through a Free-Module plug-and-play LoRA adapter, improving performance across scales and long-horizon tasks. ...
🔹 Publication Date: Published on Feb 8
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.08030
• PDF: https://arxiv.org/pdf/2602.08030
• Github: https://github.com/TemporaryLoRA/FreeLM
🔹 Models citing this paper:
• https://huggingface.co/ldsjmdy/Qwen3-8B-FreeLM-LoRA
• https://huggingface.co/ldsjmdy/Qwen3-30B-A3B-Thinking-2507-FreeLM-LoRA
• https://huggingface.co/ldsjmdy/Qwen3-235B-A22B-Thinking-2507-FreeLM-LoRA
✨ Datasets citing this paper:
• https://huggingface.co/datasets/ldsjmdy/FreeLM
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Free()LM addresses reasoning model limitations by introducing a self-forgetting mechanism through a Free-Module plug-and-play LoRA adapter, improving performance across scales and long-horizon tasks. ...
🔹 Publication Date: Published on Feb 8
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.08030
• PDF: https://arxiv.org/pdf/2602.08030
• Github: https://github.com/TemporaryLoRA/FreeLM
🔹 Models citing this paper:
• https://huggingface.co/ldsjmdy/Qwen3-8B-FreeLM-LoRA
• https://huggingface.co/ldsjmdy/Qwen3-30B-A3B-Thinking-2507-FreeLM-LoRA
• https://huggingface.co/ldsjmdy/Qwen3-235B-A22B-Thinking-2507-FreeLM-LoRA
✨ Datasets citing this paper:
• https://huggingface.co/datasets/ldsjmdy/FreeLM
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research