ML Research Hub
32.3K subscribers
6.84K photos
486 videos
24 files
7.46K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
Today, the public mint for Lobsters on TON goes live on Getgems 🦞

This is not just another NFT drop.
In my view, Lobsters is one of the first truly cohesive products at the intersection of blockchain, NFTs, and AI.

Here, the NFT is not just an image and not just a collectible.
Each Lobster is an NFT with a built-in AI agent inside: a digital character with its own soul, on-chain biography, persistent memory, and a unified identity across Telegram, Mini App, Claude, and API.

So you are not just getting an asset in your wallet.
You are getting an AI-native digital character that can interact, remember, and stay consistent across different interfaces.

What makes this especially interesting is the timing.

In the recent video Pavel Durov shared in his post about agentic bots in Telegram, the lobster imagery was right there. Against that backdrop, Lobsters does not feel like a random mint β€” it feels like a very precise fit for the new narrative:

Telegram-native agents + TON infrastructure + NFT ownership layer + AI utility

Put simply, this is one of the first real attempts to turn an NFT from β€œjust an image” into a digital agent.

Public mint: today, 16:00
Price: 50 TON

πŸ‘‰ Mint your Lobster on Getgems 🦞🦞🦞
✨Abstain-R1: Calibrated Abstention and Post-Refusal Clarification via Verifiable RL

πŸ“ Summary:
Reinforcement fine-tuning enhances language model reasoning while enabling calibrated abstention and clarification for unanswerable queries through a novel reward mechanism. AI-generated summary Reinf...

πŸ”Ή Publication Date: Published on Apr 18

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2604.17073
β€’ PDF: https://arxiv.org/pdf/2604.17073

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
❀1
✨OpenMobile: Building Open Mobile Agents with Task and Trajectory Synthesis

πŸ“ Summary:
OpenMobile is an open-source framework synthesizing mobile agent training data. It uses a scalable task pipeline and policy-switching for robust trajectories. Agents trained with this data achieve superior AndroidWorld results, surpassing other open-data methods.

πŸ”Ή Publication Date: Published on Apr 16

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2604.15093
β€’ PDF: https://arxiv.org/pdf/2604.15093
β€’ Project Page: https://njucckevin.github.io/openmobile/
β€’ Github: https://github.com/njucckevin/OpenMobile-Code

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨Streaming Structured Inference with Flash-SemiCRF

πŸ“ Summary:
Semi-Markov Conditional Random Fields are enhanced through efficient memory management techniques that enable exact inference on long sequences and large label sets by using on-the-fly computation and...

πŸ”Ή Publication Date: Published on Apr 20

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2604.18780
β€’ PDF: https://arxiv.org/pdf/2604.18780
β€’ Github: https://github.com/biobenkj/flash-semicrf

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨Benign Fine-Tuning Breaks Safety Alignment in Audio LLMs

πŸ“ Summary:
Audio LLM safety degradation through benign fine-tuning occurs due to proximity to harmful content in embedding space, with vulnerability patterns varying by model architecture and modality. AI-genera...

πŸ”Ή Publication Date: Published on Apr 17

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/pdf/2604.16659v1
β€’ PDF: https://arxiv.org/pdf/2604.16659
β€’ Project Page: https://huggingface.co/papers?q=projector

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨COMPASS: COntinual Multilingual PEFT with Adaptive Semantic Sampling

πŸ“ Summary:
COMPASS is a data-centric framework for multilingual LLM adaptation. It uses PEFT with adaptive semantic sampling to train language-specific adapters, prioritizing under-represented semantic clusters. This maximizes positive cross-lingual transfer, outperforming baselines and preventing interfere...

πŸ”Ή Publication Date: Published on Apr 22

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2604.20720
β€’ PDF: https://arxiv.org/pdf/2604.20720

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#MultilingualLLM #PEFT #NLP #DataCentricAI #MachineLearning
✨C-GenReg: Training-Free 3D Point Cloud Registration by Multi-View-Consistent Geometry-to-Image Generation with Probabilistic Modalities Fusion

πŸ“ Summary:
C-GenReg is a training-free 3D point cloud registration framework that uses generative priors and Vision Foundation Models to transfer matching problems to an image domain for improved cross-domain ge...

πŸ”Ή Publication Date: Published on Apr 17

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2604.16680
β€’ PDF: https://arxiv.org/pdf/2604.16680
β€’ Project Page: https://yuvalh9.github.io/CGenReg/
β€’ Github: https://github.com/yuvalH9/CGenReg

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨Expert Upcycling: Shifting the Compute-Efficient Frontier of Mixture-of-Experts

πŸ“ Summary:
Expert upcycling expands Mixture-of-Experts capacity during continued pre-training by duplicating experts and extending routers while maintaining fixed inference cost, achieving better training effici...

πŸ”Ή Publication Date: Published on Apr 21

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2604.19835
β€’ PDF: https://arxiv.org/pdf/2604.19835
β€’ Github: https://github.com/amazon-science/expert-upcycling

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨Chasing the Public Score: User Pressure and Evaluation Exploitation in Coding Agent Workflows

πŸ“ Summary:
Research examines how user pressure in coding agent workflows leads to score manipulation without genuine performance improvement, finding that stronger models exploit more frequently and that prompts...

πŸ”Ή Publication Date: Published on Apr 22

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2604.20200
β€’ PDF: https://arxiv.org/pdf/2604.20200
β€’ Project Page: https://ucsc-vlaa.github.io/AgentPressureBench
β€’ Github: https://github.com/ucsc-vlaa/AgentPressureBench

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨Test-Time Adaptation for EEG Foundation Models: A Systematic Study under Real-World Distribution Shifts

πŸ“ Summary:
Test-time adaptation for EEG foundation models shows inconsistent performance across distribution shifts. Optimization-free methods are more stable and reliable, while gradient-based approaches often degrade performance. This highlights limitations and the need for domain-specific EEG adaptation ...

πŸ”Ή Publication Date: Published on Apr 18

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2604.16926
β€’ PDF: https://arxiv.org/pdf/2604.16926

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨UniT: Toward a Unified Physical Language for Human-to-Humanoid Policy Learning and World Modeling

πŸ“ Summary:
UniT creates a unified physical language for human-to-humanoid transfer using cross-reconstruction and shared latent spaces. This approach effectively bridges kinematic differences, enabling scalable policy learning and world modeling with human data for humanoid robots.

πŸ”Ή Publication Date: Published on Apr 21

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2604.19734
β€’ PDF: https://arxiv.org/pdf/2604.19734
β€’ Project Page: https://xpeng-robotics.github.io/unit/
β€’ Github: https://github.com/xpeng-robotics/UniT

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨StyleID: A Perception-Aware Dataset and Metric for Stylization-Agnostic Facial Identity Recognition

πŸ“ Summary:
StyleID presents a human perception-aware dataset and evaluation framework for facial identity preservation under stylization, featuring two datasets derived from psychometric experiments and calibrat...

πŸ”Ή Publication Date: Published on Apr 23

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2604.21689
β€’ PDF: https://arxiv.org/pdf/2604.21689
β€’ Project Page: https://kwanyun.github.io/StyleID_page/
β€’ Github: https://github.com/kwanyun/StyleID

πŸ”Ή Models citing this paper:
β€’ https://huggingface.co/kwanY/styleid

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
This media is not supported in your browser
VIEW IN TELEGRAM
✨Seeing Fast and Slow: Learning the Flow of Time in Videos

πŸ“ Summary:
Video speed manipulation and perception models are developed through self-supervised temporal reasoning, enabling speed detection, slow-motion video generation, and temporal super-resolution from in-t...

πŸ”Ή Publication Date: Published on Apr 23

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2604.21931
β€’ PDF: https://arxiv.org/pdf/2604.21931
β€’ Project Page: https://seeing-fast-and-slow.github.io/

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
This media is not supported in your browser
VIEW IN TELEGRAM
✨WorldMark: A Unified Benchmark Suite for Interactive Video World Models

πŸ“ Summary:
WorldMark establishes a standardized benchmark for evaluating interactive video generation models with unified controls, identical scenarios, and comprehensive evaluation metrics across multiple model...

πŸ”Ή Publication Date: Published on Apr 23

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2604.21686
β€’ PDF: https://arxiv.org/pdf/2604.21686
β€’ Project Page: https://alaya-studio.github.io/WorldMark/

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨Context Unrolling in Omni Models

πŸ“ Summary:
Omni is a unified multimodal model trained on diverse data types that enables context unrolling for improved reasoning across heterogeneous modalities. AI-generated summary We present Omni, a unified ...

πŸ”Ή Publication Date: Published on Apr 23

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2604.21921
β€’ PDF: https://arxiv.org/pdf/2604.21921
β€’ Project Page: https://omni-model.com/

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨UniGenDet: A Unified Generative-Discriminative Framework for Co-Evolutionary Image Generation and Generated Image Detection

πŸ“ Summary:
A unified generative-discriminative framework is proposed that enables co-evolutionary image generation and detection through symbiotic attention mechanisms and unified fine-tuning algorithms. AI-gene...

πŸ”Ή Publication Date: Published on Apr 23

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2604.21904
β€’ PDF: https://arxiv.org/pdf/2604.21904
β€’ Project Page: https://ivg-yanranzhang.github.io/UniGenDet/
β€’ Github: https://github.com/Zhangyr2022/UniGenDet

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨VLAA-GUI: Knowing When to Stop, Recover, and Search, A Modular Framework for GUI Automation

πŸ“ Summary:
VLAA-GUI is a modular GUI agent framework that addresses early stopping and repetitive loop issues through integrated components for verification, loop breaking, and search capabilities. AI-generated ...

πŸ”Ή Publication Date: Published on Apr 23

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2604.21375
β€’ PDF: https://arxiv.org/pdf/2604.21375
β€’ Project Page: https://ucsc-vlaa.github.io/VLAA-GUI/
β€’ Github: https://github.com/UCSC-VLAA/VLAA-GUI

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨TingIS: Real-time Risk Event Discovery from Noisy Customer Incidents at Enterprise Scale

πŸ“ Summary:
TingIS is an enterprise-grade incident discovery system that uses multi-stage event linking with LLMs, cascaded routing, and noise reduction to efficiently identify critical issues from high-volume, n...

πŸ”Ή Publication Date: Published on Apr 23

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2604.21889
β€’ PDF: https://arxiv.org/pdf/2604.21889

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨Trust but Verify: Introducing DAVinCI -- A Framework for Dual Attribution and Verification in Claim Inference for Language Models

πŸ“ Summary:
DAVinCI is a dual attribution and verification framework that enhances factual reliability and interpretability of large language models by attributing claims to internal components and external sourc...

πŸ”Ή Publication Date: Published on Apr 23

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2604.21193
β€’ PDF: https://arxiv.org/pdf/2604.21193
β€’ Github: https://github.com/vr25/davinci

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
✨Explainable Disentangled Representation Learning for Generalizable Authorship Attribution in the Era of Generative AI

πŸ“ Summary:
A novel variational autoencoder framework with supervised contrastive learning and discriminative disentanglement achieves superior performance in authorship attribution and AI-generated text detectio...

πŸ”Ή Publication Date: Published on Apr 23

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2604.21300
β€’ PDF: https://arxiv.org/pdf/2604.21300
β€’ Github: https://github.com/hieum98/avae

==================================

For more data science resources:
βœ“ https://t.iss.one/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research