✨NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation
📝 Summary:
NextFlow is a unified decoder-only transformer enabling fast multimodal understanding and generation. It uses next-token prediction for text and next-scale for images, generating 1024x1024 images in 5 seconds. It achieves state-of-the-art performance among unified models.
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02204
• PDF: https://arxiv.org/pdf/2601.02204
• Github: https://github.com/ByteVisionLab/NextFlow
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
NextFlow is a unified decoder-only transformer enabling fast multimodal understanding and generation. It uses next-token prediction for text and next-scale for images, generating 1024x1024 images in 5 seconds. It achieves state-of-the-art performance among unified models.
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02204
• PDF: https://arxiv.org/pdf/2601.02204
• Github: https://github.com/ByteVisionLab/NextFlow
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits
📝 Summary:
Large language models (LLMs) generate fluent and complex outputs but often fail to recognize their own mistakes and hallucinations. Existing approaches typically rely on external judges, multi-sample ...
🔹 Publication Date: Published on Dec 23, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.20578
• PDF: https://arxiv.org/pdf/2512.20578
• Github: https://github.com/Amirhosein-gh98/Gnosis
🔹 Models citing this paper:
• https://huggingface.co/AmirhoseinGH/Gnosis-Qwen3-1.7B-Hybrid
• https://huggingface.co/AmirhoseinGH/Gnosis-Qwen3-4B-Instruct-2507
• https://huggingface.co/AmirhoseinGH/Gnosis-Qwen3-4B-Thinking-2507
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Large language models (LLMs) generate fluent and complex outputs but often fail to recognize their own mistakes and hallucinations. Existing approaches typically rely on external judges, multi-sample ...
🔹 Publication Date: Published on Dec 23, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.20578
• PDF: https://arxiv.org/pdf/2512.20578
• Github: https://github.com/Amirhosein-gh98/Gnosis
🔹 Models citing this paper:
• https://huggingface.co/AmirhoseinGH/Gnosis-Qwen3-1.7B-Hybrid
• https://huggingface.co/AmirhoseinGH/Gnosis-Qwen3-4B-Instruct-2507
• https://huggingface.co/AmirhoseinGH/Gnosis-Qwen3-4B-Thinking-2507
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨VAR RL Done Right: Tackling Asynchronous Policy Conflicts in Visual Autoregressive Generation
📝 Summary:
Visual autoregressive models face training instability due to asynchronous policy conflicts, which are addressed through a novel framework enhancing group relative policy optimization with intermediat...
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02256
• PDF: https://arxiv.org/pdf/2601.02256
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Visual autoregressive models face training instability due to asynchronous policy conflicts, which are addressed through a novel framework enhancing group relative policy optimization with intermediat...
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02256
• PDF: https://arxiv.org/pdf/2601.02256
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
This media is not supported in your browser
VIEW IN TELEGRAM
✨Talk2Move: Reinforcement Learning for Text-Instructed Object-Level Geometric Transformation in Scenes
📝 Summary:
Talk2Move presents a reinforcement learning-based diffusion framework that enables precise, semantically faithful spatial transformations of objects in scenes using natural language instructions. AI-g...
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02356
• PDF: https://arxiv.org/pdf/2601.02356
• Project Page: https://sparkstj.github.io/talk2move/
• Github: https://github.com/sparkstj/Talk2Move
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Talk2Move presents a reinforcement learning-based diffusion framework that enables precise, semantically faithful spatial transformations of objects in scenes using natural language instructions. AI-g...
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02356
• PDF: https://arxiv.org/pdf/2601.02356
• Project Page: https://sparkstj.github.io/talk2move/
• Github: https://github.com/sparkstj/Talk2Move
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨KV-Embedding: Training-free Text Embedding via Internal KV Re-routing in Decoder-only LLMs
📝 Summary:
KV-Embedding enables training-free representation learning from frozen LLMs by utilizing key-value states for enhanced context access and automated layer selection. AI-generated summary While LLMs are...
🔹 Publication Date: Published on Jan 3
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.01046
• PDF: https://arxiv.org/pdf/2601.01046
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
KV-Embedding enables training-free representation learning from frozen LLMs by utilizing key-value states for enhanced context access and automated layer selection. AI-generated summary While LLMs are...
🔹 Publication Date: Published on Jan 3
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.01046
• PDF: https://arxiv.org/pdf/2601.01046
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨VINO: A Unified Visual Generator with Interleaved OmniModal Context
📝 Summary:
VINO is a unified visual generator that uses a shared diffusion backbone with multimodal inputs to perform image and video generation and editing tasks. AI-generated summary We present VINO, a unified...
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02358
• PDF: https://arxiv.org/pdf/2601.02358
• Project Page: https://sotamak1r.github.io/VINO-web/
• Github: https://github.com/SOTAMak1r/VINO-code
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
VINO is a unified visual generator that uses a shared diffusion backbone with multimodal inputs to perform image and video generation and editing tasks. AI-generated summary We present VINO, a unified...
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02358
• PDF: https://arxiv.org/pdf/2601.02358
• Project Page: https://sotamak1r.github.io/VINO-web/
• Github: https://github.com/SOTAMak1r/VINO-code
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨K-EXAONE Technical Report
📝 Summary:
K-EXAONE is a multilingual language model with a Mixture-of-Experts architecture that achieves competitive performance on various benchmarks while supporting multiple languages and long-context window...
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.01739
• PDF: https://arxiv.org/pdf/2601.01739
• Github: https://github.com/LG-AI-EXAONE/K-EXAONE
🔹 Models citing this paper:
• https://huggingface.co/LGAI-EXAONE/K-EXAONE-236B-A23B
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
K-EXAONE is a multilingual language model with a Mixture-of-Experts architecture that achieves competitive performance on various benchmarks while supporting multiple languages and long-context window...
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.01739
• PDF: https://arxiv.org/pdf/2601.01739
• Github: https://github.com/LG-AI-EXAONE/K-EXAONE
🔹 Models citing this paper:
• https://huggingface.co/LGAI-EXAONE/K-EXAONE-236B-A23B
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling
📝 Summary:
Falcon-H1R is a 7B-parameter language model that achieves competitive reasoning performance through efficient training strategies and architectural design, enabling scalable reasoning capabilities in ...
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02346
• PDF: https://arxiv.org/pdf/2601.02346
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Falcon-H1R is a 7B-parameter language model that achieves competitive reasoning performance through efficient training strategies and architectural design, enabling scalable reasoning capabilities in ...
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02346
• PDF: https://arxiv.org/pdf/2601.02346
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨OpenNovelty: An LLM-powered Agentic System for Verifiable Scholarly Novelty Assessment
📝 Summary:
OpenNovelty is an LLM-powered agentic system for verifiable scholarly novelty assessment in peer review. It retrieves and analyzes prior work via semantic search and taxonomy construction, generating evidence-backed reports grounded in real papers. This tool aims to promote fair, consistent, and ...
🔹 Publication Date: Published on Jan 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.01576
• PDF: https://arxiv.org/pdf/2601.01576
• Project Page: https://www.opennovelty.org/
• Github: https://github.com/january-blue/OpenNovelty
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
OpenNovelty is an LLM-powered agentic system for verifiable scholarly novelty assessment in peer review. It retrieves and analyzes prior work via semantic search and taxonomy construction, generating evidence-backed reports grounded in real papers. This tool aims to promote fair, consistent, and ...
🔹 Publication Date: Published on Jan 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.01576
• PDF: https://arxiv.org/pdf/2601.01576
• Project Page: https://www.opennovelty.org/
• Github: https://github.com/january-blue/OpenNovelty
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs
📝 Summary:
COMPASS evaluates large language models' compliance with organizational policies, revealing significant gaps in enforcing prohibitions despite strong performance on legitimate requests. AI-generated s...
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.01836
• PDF: https://arxiv.org/pdf/2601.01836
• Github: https://github.com/AIM-Intelligence/COMPASS
🔹 Models citing this paper:
• https://huggingface.co/AIM-Intelligence/COMPASS_Qwen2.5-7B-Instruct_LoRA
• https://huggingface.co/AIM-Intelligence/COMPASS_gemma-3-4b-it_LoRA
✨ Datasets citing this paper:
• https://huggingface.co/datasets/AIM-Intelligence/COMPASS-Policy-Alignment-Testbed-Dataset
• https://huggingface.co/datasets/AIM-Intelligence/COMPASS-Policy-aware-SFT-Dataset
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
COMPASS evaluates large language models' compliance with organizational policies, revealing significant gaps in enforcing prohibitions despite strong performance on legitimate requests. AI-generated s...
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.01836
• PDF: https://arxiv.org/pdf/2601.01836
• Github: https://github.com/AIM-Intelligence/COMPASS
🔹 Models citing this paper:
• https://huggingface.co/AIM-Intelligence/COMPASS_Qwen2.5-7B-Instruct_LoRA
• https://huggingface.co/AIM-Intelligence/COMPASS_gemma-3-4b-it_LoRA
✨ Datasets citing this paper:
• https://huggingface.co/datasets/AIM-Intelligence/COMPASS-Policy-Alignment-Testbed-Dataset
• https://huggingface.co/datasets/AIM-Intelligence/COMPASS-Policy-aware-SFT-Dataset
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
arXiv.org
COMPASS: A Framework for Evaluating Organization-Specific Policy...
As large language models are deployed in high-stakes enterprise applications, from healthcare to finance, ensuring adherence to organization-specific policies has become essential. Yet existing...
✨Project Ariadne: A Structural Causal Framework for Auditing Faithfulness in LLM Agents
📝 Summary:
Project Ariadne uses structural causal models and counterfactual logic to evaluate the causal integrity of LLM reasoning, revealing a faithfulness gap where reasoning traces are not reliable drivers o...
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02314
• PDF: https://arxiv.org/pdf/2601.02314
• Github: https://github.com/skhanzad/AridadneXAI
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Project Ariadne uses structural causal models and counterfactual logic to evaluate the causal integrity of LLM reasoning, revealing a faithfulness gap where reasoning traces are not reliable drivers o...
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02314
• PDF: https://arxiv.org/pdf/2601.02314
• Github: https://github.com/skhanzad/AridadneXAI
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨GARDO: Reinforcing Diffusion Models without Reward Hacking
📝 Summary:
Online reinforcement learning for diffusion model fine-tuning suffers from reward hacking due to proxy reward mismatches, which GARDO addresses through selective regularization, adaptive reference upd...
🔹 Publication Date: Published on Dec 30, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.24138
• PDF: https://arxiv.org/pdf/2512.24138
• Project Page: https://tinnerhrhe.github.io/gardo_project/
• Github: https://github.com/tinnerhrhe/gardo
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Online reinforcement learning for diffusion model fine-tuning suffers from reward hacking due to proxy reward mismatches, which GARDO addresses through selective regularization, adaptive reference upd...
🔹 Publication Date: Published on Dec 30, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.24138
• PDF: https://arxiv.org/pdf/2512.24138
• Project Page: https://tinnerhrhe.github.io/gardo_project/
• Github: https://github.com/tinnerhrhe/gardo
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
❤1
✨IMA++: ISIC Archive Multi-Annotator Dermoscopic Skin Lesion Segmentation Dataset
📝 Summary:
A large-scale public multi-annotator skin lesion segmentation dataset is introduced with extensive metadata for annotator analysis and consensus modeling. AI-generated summary Multi-annotator medical ...
🔹 Publication Date: Published on Dec 25, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.21472
• PDF: https://arxiv.org/pdf/2512.21472
• Github: https://github.com/sfu-mial/IMAplusplus
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
A large-scale public multi-annotator skin lesion segmentation dataset is introduced with extensive metadata for annotator analysis and consensus modeling. AI-generated summary Multi-annotator medical ...
🔹 Publication Date: Published on Dec 25, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.21472
• PDF: https://arxiv.org/pdf/2512.21472
• Github: https://github.com/sfu-mial/IMAplusplus
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
❤1
✨Toward Stable Semi-Supervised Remote Sensing Segmentation via Co-Guidance and Co-Fusion
📝 Summary:
A semi-supervised remote sensing image segmentation framework combines vision-language and self-supervised models to reduce pseudo-label drift through dual-student architecture and semantic co-guidanc...
🔹 Publication Date: Published on Dec 28, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.23035
• PDF: https://arxiv.org/pdf/2512.23035
• Project Page: https://xavierjiezou.github.io/Co2S/
• Github: https://github.com/XavierJiezou/Co2S
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
A semi-supervised remote sensing image segmentation framework combines vision-language and self-supervised models to reduce pseudo-label drift through dual-student architecture and semantic co-guidanc...
🔹 Publication Date: Published on Dec 28, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.23035
• PDF: https://arxiv.org/pdf/2512.23035
• Project Page: https://xavierjiezou.github.io/Co2S/
• Github: https://github.com/XavierJiezou/Co2S
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
❤1
✨Recursive Language Models
📝 Summary:
Recursive Language Models RLMs allow LLMs to process arbitrarily long prompts. RLMs programmatically decompose prompts and recursively call the LLM over snippets. This extends input length 100x and improves performance, even for shorter prompts, at similar cost.
🔹 Publication Date: Published on Dec 31, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.24601
• PDF: https://arxiv.org/pdf/2512.24601
• Github: https://github.com/alexzhang13/rlm/tree/main
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#LLMs #AI #NLP #RecursiveLMs #LongContext
📝 Summary:
Recursive Language Models RLMs allow LLMs to process arbitrarily long prompts. RLMs programmatically decompose prompts and recursively call the LLM over snippets. This extends input length 100x and improves performance, even for shorter prompts, at similar cost.
🔹 Publication Date: Published on Dec 31, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.24601
• PDF: https://arxiv.org/pdf/2512.24601
• Github: https://github.com/alexzhang13/rlm/tree/main
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#LLMs #AI #NLP #RecursiveLMs #LongContext
❤1
✨InfiniteVGGT: Visual Geometry Grounded Transformer for Endless Streams
📝 Summary:
InfiniteVGGT enables continuous 3D visual geometry understanding for infinite streams. It uses a causal transformer with adaptive rolling memory for long-term stability, outperforming existing streaming methods. A new Long3D benchmark is introduced for rigorous evaluation of such systems.
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02281
• PDF: https://arxiv.org/pdf/2601.02281
• Github: https://github.com/AutoLab-SAI-SJTU/InfiniteVGGT
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#VisualGeometry #3DVision #Transformers #StreamingAI #DeepLearning
📝 Summary:
InfiniteVGGT enables continuous 3D visual geometry understanding for infinite streams. It uses a causal transformer with adaptive rolling memory for long-term stability, outperforming existing streaming methods. A new Long3D benchmark is introduced for rigorous evaluation of such systems.
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02281
• PDF: https://arxiv.org/pdf/2601.02281
• Github: https://github.com/AutoLab-SAI-SJTU/InfiniteVGGT
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#VisualGeometry #3DVision #Transformers #StreamingAI #DeepLearning
✨SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving
📝 Summary:
SWE-Lego achieves state-of-the-art software issue resolution through a lightweight supervised fine-tuning approach. It uses a high-quality dataset and refined training procedures like error masking and a difficulty-based curriculum, outperforming complex methods. Performance is further boosted by...
🔹 Publication Date: Published on Jan 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.01426
• PDF: https://arxiv.org/pdf/2601.01426
• Project Page: https://github.com/SWE-Lego/SWE-Lego
• Github: https://github.com/SWE-Lego/SWE-Lego
🔹 Models citing this paper:
• https://huggingface.co/SWE-Lego/SWE-Lego-Qwen3-8B
• https://huggingface.co/SWE-Lego/SWE-Lego-Qwen3-32B
✨ Datasets citing this paper:
• https://huggingface.co/datasets/SWE-Lego/SWE-Lego-Real-Data
• https://huggingface.co/datasets/SWE-Lego/SWE-Lego-Synthetic-Data
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#SoftwareEngineering #MachineLearning #LLM #FineTuning #AIforCode
📝 Summary:
SWE-Lego achieves state-of-the-art software issue resolution through a lightweight supervised fine-tuning approach. It uses a high-quality dataset and refined training procedures like error masking and a difficulty-based curriculum, outperforming complex methods. Performance is further boosted by...
🔹 Publication Date: Published on Jan 4
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.01426
• PDF: https://arxiv.org/pdf/2601.01426
• Project Page: https://github.com/SWE-Lego/SWE-Lego
• Github: https://github.com/SWE-Lego/SWE-Lego
🔹 Models citing this paper:
• https://huggingface.co/SWE-Lego/SWE-Lego-Qwen3-8B
• https://huggingface.co/SWE-Lego/SWE-Lego-Qwen3-32B
✨ Datasets citing this paper:
• https://huggingface.co/datasets/SWE-Lego/SWE-Lego-Real-Data
• https://huggingface.co/datasets/SWE-Lego/SWE-Lego-Synthetic-Data
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#SoftwareEngineering #MachineLearning #LLM #FineTuning #AIforCode
arXiv.org
SWE-Lego: Pushing the Limits of Supervised Fine-tuning for...
We present SWE-Lego, a supervised fine-tuning (SFT) recipe designed to achieve state-ofthe-art performance in software engineering (SWE) issue resolving. In contrast to prevalent methods that rely...
✨M-ErasureBench: A Comprehensive Multimodal Evaluation Benchmark for Concept Erasure in Diffusion Models
📝 Summary:
Existing concept erasure methods in diffusion models are vulnerable to non-text inputs. M-ErasureBench is a new multimodal evaluation framework, and IRECE is a module to restore robustness against these attacks, reducing concept reproduction.
🔹 Publication Date: Published on Dec 28, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.22877
• PDF: https://arxiv.org/pdf/2512.22877
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#DiffusionModels #ConceptErasure #MultimodalAI #AISafety #MachineLearning
📝 Summary:
Existing concept erasure methods in diffusion models are vulnerable to non-text inputs. M-ErasureBench is a new multimodal evaluation framework, and IRECE is a module to restore robustness against these attacks, reducing concept reproduction.
🔹 Publication Date: Published on Dec 28, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2512.22877
• PDF: https://arxiv.org/pdf/2512.22877
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#DiffusionModels #ConceptErasure #MultimodalAI #AISafety #MachineLearning