✨REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents
📝 Summary:
REDSearcher presents a unified framework for optimizing search agents through improved task synthesis, tool-augmented queries, midtraining capability enhancement, and simulated environments to address...
🔹 Publication Date: Published on Feb 15
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.14234
• PDF: https://arxiv.org/pdf/2602.14234
• Project Page: https://redsearchagent.github.io/index/
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
REDSearcher presents a unified framework for optimizing search agents through improved task synthesis, tool-augmented queries, midtraining capability enhancement, and simulated environments to address...
🔹 Publication Date: Published on Feb 15
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.14234
• PDF: https://arxiv.org/pdf/2602.14234
• Project Page: https://redsearchagent.github.io/index/
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Embed-RL: Reinforcement Learning for Reasoning-Driven Multimodal Embeddings
📝 Summary:
A reasoning-driven universal multimodal embedding framework integrates embedder-guided reinforcement learning with traceability chain-of-thought to enhance cross-modal semantic consistency and retriev...
🔹 Publication Date: Published on Feb 14
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.13823
• PDF: https://arxiv.org/pdf/2602.13823
• Github: https://github.com/ZoengHN/Embed-RL
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
A reasoning-driven universal multimodal embedding framework integrates embedder-guided reinforcement learning with traceability chain-of-thought to enhance cross-modal semantic consistency and retriev...
🔹 Publication Date: Published on Feb 14
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.13823
• PDF: https://arxiv.org/pdf/2602.13823
• Github: https://github.com/ZoengHN/Embed-RL
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨UniWeTok: An Unified Binary Tokenizer with Codebook Size 2^{128} for Unified Multimodal Large Language Model
📝 Summary:
UniWeTok introduces a unified discrete tokenizer with a massive binary codebook and novel training techniques to achieve superior performance in image generation and multimodal tasks while reducing co...
🔹 Publication Date: Published on Feb 15
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.14178
• PDF: https://arxiv.org/pdf/2602.14178
• Github: https://github.com/shallowdream204/BitDance
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
UniWeTok introduces a unified discrete tokenizer with a massive binary codebook and novel training techniques to achieve superior performance in image generation and multimodal tasks while reducing co...
🔹 Publication Date: Published on Feb 15
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.14178
• PDF: https://arxiv.org/pdf/2602.14178
• Github: https://github.com/shallowdream204/BitDance
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨LaViDa-R1: Advancing Reasoning for Unified Multimodal Diffusion Language Models
📝 Summary:
LaViDa-R1 is a multimodal reasoning diffusion language model that unifies supervised fine-tuning and multi-task reinforcement learning with novel training techniques for enhanced performance across vi...
🔹 Publication Date: Published on Feb 15
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.14147
• PDF: https://arxiv.org/pdf/2602.14147
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
LaViDa-R1 is a multimodal reasoning diffusion language model that unifies supervised fine-tuning and multi-task reinforcement learning with novel training techniques for enhanced performance across vi...
🔹 Publication Date: Published on Feb 15
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.14147
• PDF: https://arxiv.org/pdf/2602.14147
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨BrowseComp-V^3: A Visual, Vertical, and Verifiable Benchmark for Multimodal Browsing Agents
📝 Summary:
A new benchmark called BrowseComp-V3 challenges multimodal large language models with complex, multi-hop reasoning tasks requiring deep search across text and visual modalities, revealing significant ...
🔹 Publication Date: Published on Feb 13
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.12876
• PDF: https://arxiv.org/pdf/2602.12876
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
A new benchmark called BrowseComp-V3 challenges multimodal large language models with complex, multi-hop reasoning tasks requiring deep search across text and visual modalities, revealing significant ...
🔹 Publication Date: Published on Feb 13
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.12876
• PDF: https://arxiv.org/pdf/2602.12876
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨FireRed-Image-Edit-1.0 Techinical Report
📝 Summary:
FireRed-Image-Edit uses a diffusion transformer with optimized data curation and training methods to achieve state-of-the-art performance in instruction-based image editing, supported by a comprehensi...
🔹 Publication Date: Published on Feb 12
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.13344
• PDF: https://arxiv.org/pdf/2602.13344
• Project Page: https://huggingface.co/spaces/FireRedTeam/FireRed-Image-Edit-1.0
• Github: https://github.com/FireRedTeam/FireRed-Image-Edit
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
FireRed-Image-Edit uses a diffusion transformer with optimized data curation and training methods to achieve state-of-the-art performance in instruction-based image editing, supported by a comprehensi...
🔹 Publication Date: Published on Feb 12
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.13344
• PDF: https://arxiv.org/pdf/2602.13344
• Project Page: https://huggingface.co/spaces/FireRedTeam/FireRed-Image-Edit-1.0
• Github: https://github.com/FireRedTeam/FireRed-Image-Edit
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨AIDev: Studying AI Coding Agents on GitHub
📝 Summary:
AIDev is a large-scale dataset of agent-authored pull requests from real-world GitHub repositories that captures AI coding agent usage in practical software development scenarios. AI-generated summary...
🔹 Publication Date: Published on Feb 9
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2507.15003
• PDF: https://arxiv.org/pdf/2602.09185
• Project Page: https://huggingface.co/datasets/hao-li/AIDev
• Github: https://huggingface.co/papers?q=GitHub%20repositories
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
AIDev is a large-scale dataset of agent-authored pull requests from real-world GitHub repositories that captures AI coding agent usage in practical software development scenarios. AI-generated summary...
🔹 Publication Date: Published on Feb 9
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2507.15003
• PDF: https://arxiv.org/pdf/2602.09185
• Project Page: https://huggingface.co/datasets/hao-li/AIDev
• Github: https://huggingface.co/papers?q=GitHub%20repositories
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨A Critical Look at Targeted Instruction Selection: Disentangling What Matters (and What Doesn't)
📝 Summary:
Targeted instruction selection for LLM fine-tuning can be improved by systematically analyzing data representation and selection algorithms, with gradient-based representations and greedy round-robin ...
🔹 Publication Date: Published on Feb 16
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.14696
• PDF: https://arxiv.org/pdf/2602.14696
• Github: https://github.com/dcml-lab/targeted-instruction-selection
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Targeted instruction selection for LLM fine-tuning can be improved by systematically analyzing data representation and selection algorithms, with gradient-based representations and greedy round-robin ...
🔹 Publication Date: Published on Feb 16
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.14696
• PDF: https://arxiv.org/pdf/2602.14696
• Github: https://github.com/dcml-lab/targeted-instruction-selection
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨BitDance: Scaling Autoregressive Generative Models with Binary Tokens
📝 Summary:
BitDance is a scalable autoregressive image generator using binary visual tokens and a binary diffusion head. It introduces next-patch diffusion for parallel token prediction, significantly improving inference speed and achieving state-of-the-art performance with fewer parameters.
🔹 Publication Date: Published on Feb 15
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.14041
• PDF: https://arxiv.org/pdf/2602.14041
• Github: https://github.com/shallowdream204/BitDance
🔹 Models citing this paper:
• https://huggingface.co/shallowdream204/BitDance-14B-16x
• https://huggingface.co/shallowdream204/BitDance-14B-64x
• https://huggingface.co/shallowdream204/BitDance-ImageNet
✨ Spaces citing this paper:
• https://huggingface.co/spaces/shallowdream204/BitDance-14B-64x
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
BitDance is a scalable autoregressive image generator using binary visual tokens and a binary diffusion head. It introduces next-patch diffusion for parallel token prediction, significantly improving inference speed and achieving state-of-the-art performance with fewer parameters.
🔹 Publication Date: Published on Feb 15
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.14041
• PDF: https://arxiv.org/pdf/2602.14041
• Github: https://github.com/shallowdream204/BitDance
🔹 Models citing this paper:
• https://huggingface.co/shallowdream204/BitDance-14B-16x
• https://huggingface.co/shallowdream204/BitDance-14B-64x
• https://huggingface.co/shallowdream204/BitDance-ImageNet
✨ Spaces citing this paper:
• https://huggingface.co/spaces/shallowdream204/BitDance-14B-64x
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
arXiv.org
BitDance: Scaling Autoregressive Generative Models with Binary Tokens
We present BitDance, a scalable autoregressive (AR) image generator that predicts binary visual tokens instead of codebook indices. With high-entropy binary latents, BitDance lets each token...
✨WebWorld: A Large-Scale World Model for Web Agent Training
📝 Summary:
WebWorld is an open-web simulator trained on over one million interactions that supports long-horizon reasoning and multi-format data, achieving performance comparable to advanced models like Gemini-3...
🔹 Publication Date: Published on Feb 16
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.14721
• PDF: https://arxiv.org/pdf/2602.14721
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
WebWorld is an open-web simulator trained on over one million interactions that supports long-horizon reasoning and multi-format data, achieving performance comparable to advanced models like Gemini-3...
🔹 Publication Date: Published on Feb 16
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.14721
• PDF: https://arxiv.org/pdf/2602.14721
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨MoRL: Reinforced Reasoning for Unified Motion Understanding and Generation
📝 Summary:
MoRL is a unified multimodal motion model using reinforcement learning with verifiable rewards. It significantly improves human motion understanding and generation through enhanced semantic alignment, reasoning, and physical plausibility, outperforming baselines.
🔹 Publication Date: Published on Feb 16
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.14534
• PDF: https://arxiv.org/pdf/2602.14534
• Project Page: https://aigeeksgroup.github.io/MoRL/
• Github: https://aigeeksgroup.github.io/MoRL/
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
MoRL is a unified multimodal motion model using reinforcement learning with verifiable rewards. It significantly improves human motion understanding and generation through enhanced semantic alignment, reasoning, and physical plausibility, outperforming baselines.
🔹 Publication Date: Published on Feb 16
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.14534
• PDF: https://arxiv.org/pdf/2602.14534
• Project Page: https://aigeeksgroup.github.io/MoRL/
• Github: https://aigeeksgroup.github.io/MoRL/
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Preliminary sonification of ENSO using traditional Javanese gamelan scales
📝 Summary:
Parameter-mapping sonification of ENSO data preserves dynamical signatures through acoustic phase space analysis, revealing distinct coupling regimes in traditional musical scales. AI-generated summar...
🔹 Publication Date: Published on Feb 16
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.14560
• PDF: https://arxiv.org/pdf/2602.14560
• Project Page: https://doi.org/10.17605/OSF.IO/QY82M
• Github: https://github.com/sandyherho/suppl-enso-javanese-sonification
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Parameter-mapping sonification of ENSO data preserves dynamical signatures through acoustic phase space analysis, revealing distinct coupling regimes in traditional musical scales. AI-generated summar...
🔹 Publication Date: Published on Feb 16
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.14560
• PDF: https://arxiv.org/pdf/2602.14560
• Project Page: https://doi.org/10.17605/OSF.IO/QY82M
• Github: https://github.com/sandyherho/suppl-enso-javanese-sonification
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Query as Anchor: Scenario-Adaptive User Representation via Large Language Model
📝 Summary:
Query-as-Anchor is a novel framework shifting user modeling from static encoding to dynamic query-aware synthesis using large language models. It employs specialized architecture and training, achieving state-of-the-art performance and efficient deployment in industrial settings.
🔹 Publication Date: Published on Feb 16
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.14492
• PDF: https://arxiv.org/pdf/2602.14492
• Github: https://github.com/JhCircle/Q-Anchor
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Query-as-Anchor is a novel framework shifting user modeling from static encoding to dynamic query-aware synthesis using large language models. It employs specialized architecture and training, achieving state-of-the-art performance and efficient deployment in industrial settings.
🔹 Publication Date: Published on Feb 16
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.14492
• PDF: https://arxiv.org/pdf/2602.14492
• Github: https://github.com/JhCircle/Q-Anchor
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Acoustivision Pro: An Open-Source Interactive Platform for Room Impulse Response Analysis and Acoustic Characterization
📝 Summary:
Room acoustics analysis plays a central role in architectural design, audio engineering, speech intelligibility assessment, and hearing research. Despite the availability of standardized metrics such ...
🔹 Publication Date: Published on Feb 11
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.12299
• PDF: https://arxiv.org/pdf/2602.12299
• Project Page: https://huggingface.co/spaces/mandipgoswami/acoustivision-pro
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Room acoustics analysis plays a central role in architectural design, audio engineering, speech intelligibility assessment, and hearing research. Despite the availability of standardized metrics such ...
🔹 Publication Date: Published on Feb 11
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.12299
• PDF: https://arxiv.org/pdf/2602.12299
• Project Page: https://huggingface.co/spaces/mandipgoswami/acoustivision-pro
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
Media is too big
VIEW IN TELEGRAM
✨Conversational Image Segmentation: Grounding Abstract Concepts with Scalable Supervision
📝 Summary:
Conversational image segmentation addresses functional and physical reasoning tasks by introducing a new benchmark and model that combines segmentation priors with language understanding. AI-generated...
🔹 Publication Date: Published on Feb 13
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.13195
• PDF: https://arxiv.org/pdf/2602.13195
• Project Page: https://glab-caltech.github.io/converseg/
• Github: https://github.com/AadSah/ConverSeg
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Conversational image segmentation addresses functional and physical reasoning tasks by introducing a new benchmark and model that combines segmentation priors with language understanding. AI-generated...
🔹 Publication Date: Published on Feb 13
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.13195
• PDF: https://arxiv.org/pdf/2602.13195
• Project Page: https://glab-caltech.github.io/converseg/
• Github: https://github.com/AadSah/ConverSeg
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Experiential Reinforcement Learning
📝 Summary:
Experiential Reinforcement Learning ERL addresses challenges in sparse-reward environments by embedding an explicit experience-reflection-consolidation loop. This process converts feedback into structured behavioral revision, significantly improving learning efficiency and performance without add...
🔹 Publication Date: Published on Feb 15
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.13949
• PDF: https://arxiv.org/pdf/2602.13949
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#ReinforcementLearning #MachineLearning #AI #ERL #SparseRewards
📝 Summary:
Experiential Reinforcement Learning ERL addresses challenges in sparse-reward environments by embedding an explicit experience-reflection-consolidation loop. This process converts feedback into structured behavioral revision, significantly improving learning efficiency and performance without add...
🔹 Publication Date: Published on Feb 15
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.13949
• PDF: https://arxiv.org/pdf/2602.13949
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#ReinforcementLearning #MachineLearning #AI #ERL #SparseRewards
✨Exposing the Systematic Vulnerability of Open-Weight Models to Prefill Attacks
📝 Summary:
A study reveals prefill attacks as a critical, underexplored vulnerability in open-weight language models. These attacks, which predefine initial response tokens, consistently compromise major models, necessitating urgent defense development.
🔹 Publication Date: Published on Feb 16
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.14689
• PDF: https://arxiv.org/pdf/2602.14689
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#PrefillAttacks #LLMSecurity #AIvulnerability #OpenWeightModels #LanguageModels
📝 Summary:
A study reveals prefill attacks as a critical, underexplored vulnerability in open-weight language models. These attacks, which predefine initial response tokens, consistently compromise major models, necessitating urgent defense development.
🔹 Publication Date: Published on Feb 16
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.14689
• PDF: https://arxiv.org/pdf/2602.14689
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#PrefillAttacks #LLMSecurity #AIvulnerability #OpenWeightModels #LanguageModels
This media is not supported in your browser
VIEW IN TELEGRAM
✨InnoEval: On Research Idea Evaluation as a Knowledge-Grounded, Multi-Perspective Reasoning Problem
📝 Summary:
InnoEval offers a new framework for evaluating research ideas, addressing the limitations of current methods. It uses knowledge-grounded, multi-perspective reasoning, employing deep knowledge search and an innovation review board for multi-dimensional assessment. It outperforms baselines and alig...
🔹 Publication Date: Published on Feb 16
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.14367
• PDF: https://arxiv.org/pdf/2602.14367
• Project Page: https://innoeval.zjukg.cn/
• Github: https://github.com/zjunlp/InnoEval
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#ResearchEvaluation #KnowledgeReasoning #AI #Innovation #NLP
📝 Summary:
InnoEval offers a new framework for evaluating research ideas, addressing the limitations of current methods. It uses knowledge-grounded, multi-perspective reasoning, employing deep knowledge search and an innovation review board for multi-dimensional assessment. It outperforms baselines and alig...
🔹 Publication Date: Published on Feb 16
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.14367
• PDF: https://arxiv.org/pdf/2602.14367
• Project Page: https://innoeval.zjukg.cn/
• Github: https://github.com/zjunlp/InnoEval
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#ResearchEvaluation #KnowledgeReasoning #AI #Innovation #NLP
✨Benchmarking Knowledge-Extraction Attack and Defense on Retrieval-Augmented Generation
📝 Summary:
This paper introduces the first systematic benchmark for evaluating knowledge-extraction attacks and defenses on Retrieval-Augmented Generation systems. It standardizes testing across diverse models and strategies to enable comparable evaluation and help build privacy-preserving RAG.
🔹 Publication Date: Published on Feb 10
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.09319
• PDF: https://arxiv.org/pdf/2602.09319
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#RAG #KnowledgeExtraction #Cybersecurity #AIPrivacy #Benchmarking
📝 Summary:
This paper introduces the first systematic benchmark for evaluating knowledge-extraction attacks and defenses on Retrieval-Augmented Generation systems. It standardizes testing across diverse models and strategies to enable comparable evaluation and help build privacy-preserving RAG.
🔹 Publication Date: Published on Feb 10
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.09319
• PDF: https://arxiv.org/pdf/2602.09319
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#RAG #KnowledgeExtraction #Cybersecurity #AIPrivacy #Benchmarking
✨Blind to the Human Touch: Overlap Bias in LLM-Based Summary Evaluation
📝 Summary:
LLM judges show bias, increasingly preferring AI-generated summaries over human ones as similarity to human references decreases. This widespread bias across models suggests LLM-as-a-judge needs more sophisticated evaluation beyond simple comparison.
🔹 Publication Date: Published on Feb 7
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.07673
• PDF: https://arxiv.org/pdf/2602.07673
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#LLM #AIbias #AIEvaluation #NLP #AIethics
📝 Summary:
LLM judges show bias, increasingly preferring AI-generated summaries over human ones as similarity to human references decreases. This widespread bias across models suggests LLM-as-a-judge needs more sophisticated evaluation beyond simple comparison.
🔹 Publication Date: Published on Feb 7
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2602.07673
• PDF: https://arxiv.org/pdf/2602.07673
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#LLM #AIbias #AIEvaluation #NLP #AIethics