Data Science | Machine Learning with Python for Researchers

🤖🧠 Wan 2.1: Alibaba’s Open-Source Revolution in Video Generation

🗓️ 21 Oct 2025
📚 AI News & Trends

The landscape of artificial intelligence has been evolving rapidly, especially in the domain of video generation. Since OpenAI unveiled Sora in 2024, the world has witnessed an explosive surge in research and innovation within generative AI. However, most of these cutting-edge tools remained closed-source limiting transparency and accessibility. Recognizing this gap, Alibaba Group introduced Wan, ...

#Alibaba #Wan2.1 #VideoGeneration #GenerativeAI #OpenSource #ArtificialIntelligence

❤1

315 views11:47

📖 Read More

📣 BEST TELEGRAM CHANNELS

Data Science | Machine Learning with Python for Researchers

🤖🧠 Master Machine Learning: Explore the Ultimate “Machine-Learning-Tutorials” Repository

🗓️ 23 Oct 2025
📚 AI News & Trends

In today’s data-driven world, Machine Learning (ML) has become the cornerstone of modern technology from intelligent chatbots to predictive analytics and recommendation systems. However, mastering ML isn’t just about coding, it requires a structured understanding of algorithms, statistics, optimization techniques and real-world problem-solving. That’s where Ujjwal Karn’s Machine-Learning-Tutorials GitHub repository stands out. This open-source, topic-wise ...

#MachineLearning #MLTutorials #ArtificialIntelligence #DataScience #OpenSource #AIEducation

353 views14:56

📖 Read More

📣 BEST TELEGRAM CHANNELS

Data Science | Machine Learning with Python for Researchers

🤖🧠 LangChain: The Ultimate Framework for Building Reliable AI Agents and LLM Applications

🗓️ 24 Oct 2025
📚 AI News & Trends

As artificial intelligence continues to transform industries, developers are racing to build smarter, more adaptive applications powered by Large Language Models (LLMs). Yet, one major challenge remains how to make these models interact intelligently with real-world data and external systems in a scalable, reliable way. Enter LangChain, an open-source framework designed to make LLM-powered application ...

#LangChain #AI #LLM #ArtificialIntelligence #OpenSource #AIAgents

❤1

343 views15:56

📖 Read More

📣 BEST TELEGRAM CHANNELS

Data Science | Machine Learning with Python for Researchers

🤖🧠 Microsoft Data Formulator: Revolutionizing AI-Powered Data Visualization

🗓️ 28 Oct 2025
📚 AI News & Trends

In today’s data-driven world, visualization is everything. Whether you’re a business analyst, data scientist or researcher, the ability to convert raw data into meaningful visuals can define the success of your decisions. That’s where Microsoft’s Data Formulator steps in a cutting-edge, open-source platform designed to empower analysts to create rich, AI-assisted visualizations effortlessly. Developed by ...

#Microsoft #DataVisualization #AI #DataScience #OpenSource #Analytics

192 views23:16

📖 Read More

📣 BEST TELEGRAM CHANNELS

Data Science | Machine Learning with Python for Researchers

⚡️

OpenAI presents gpt-oss-safeguard - open models for flexible and transparent content moderation

Instead of a rigidly trained classifier, the model takes your own security policy as input and reasons whether the message complies with this policy.

The result is not just "safe/unsafe," but a chain of reasoning that you can verify and improve.

The models are available in two sizes: 120B and 20B.
• gpt-oss-safeguard-120B
• gpt-oss-safeguard-20B

💡 Why they are needed:
• Policies can be changed without retraining the model
• Suitable for niche or rapidly changing risks (e.g., cheating in games or fake reviews)
• Does not require thousands of labeled examples
• Ideal when explainability is important rather than minimal latency

Both are available under the Apache 2.0 license - they can be freely used, modified, and deployed.

🔗 Official announcement
🤗 H ugging Face

#openai #chatgpt #opensource

Please open Telegram to view this post

VIEW IN TELEGRAM

Please open Telegram to view this post

VIEW IN TELEGRAM

257 views13:37

Data Science | Machine Learning with Python for Researchers

🤖🧠 Reflex: Build Full-Stack Web Apps in Pure Python — Fast, Flexible and Powerful

🗓️ 29 Oct 2025
📚 AI News & Trends

Building modern web applications has traditionally required mastering multiple languages and frameworks from JavaScript for the frontend to Python, Java or Node.js for the backend. For many developers, switching between different technologies can slow down productivity and increase complexity. Reflex eliminates that problem. It is an innovative open-source full-stack web framework that allows developers to ...

#Reflex #FullStack #WebDevelopment #Python #OpenSource #WebApps

289 views19:14

📖 Read More

📣 BEST TELEGRAM CHANNELS

Data Science | Machine Learning with Python for Researchers

🤖🧠 MiniMax-M2: The Open-Source Revolution Powering Coding and Agentic Intelligence

🗓️ 30 Oct 2025
📚 AI News & Trends

Artificial intelligence is evolving faster than ever, but not every innovation needs to be enormous to make an impact. MiniMax-M2, the latest release from MiniMax-AI, demonstrates that efficiency and power can coexist within a streamlined framework. MiniMax-M2 is an open-source Mixture of Experts (MoE) model designed for coding tasks, multi-agent collaboration and automation workflows. With ...

#MiniMaxM2 #OpenSource #MachineLearning #CodingAI #AgenticIntelligence #MixtureOfExperts

263 views21:15

📖 Read More

📣 BEST TELEGRAM CHANNELS

Data Science | Machine Learning with Python for Researchers

🤖🧠 LongCat-Video: Meituan’s Groundbreaking Step Toward Efficient Long Video Generation with AI

🗓️ 04 Nov 2025
📚 AI News & Trends

In the rapidly advancing field of generative AI, the ability to create realistic, coherent, and high-quality videos from text or images has become one of the most sought-after goals. Meituan, one of the leading technology innovators in China, has made a remarkable stride in this domain with its latest open-source model — LongCat-Video. Designed as ...

#LongCatVideo #Meituan #GenerativeAI #VideoGeneration #AIInnovation #OpenSource

219 views22:21

📖 Read More

📣 BEST TELEGRAM CHANNELS

Data Science | Machine Learning with Python for Researchers

✨olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models

📝 Summary:
olmOCR is an open-source toolkit that uses a fine-tuned vision language model to convert PDFs into clean, structured text. It enables large-scale, cost-effective extraction of trillions of tokens for training language models.

🔹 Publication Date: Published on Feb 25

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2502.18443
• PDF: https://arxiv.org/pdf/2502.18443
• Github: https://github.com/allenai/olmocr

✨ Datasets citing this paper:
• https://huggingface.co/datasets/davanstrien/test-olmocr2
• https://huggingface.co/datasets/davanstrien/newspapers-olmocr2
• https://huggingface.co/datasets/stckmn/ocr-output-Directive017-1761355297

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#OCR #VLMs #LLM #DataExtraction #OpenSource

63 views05:56

✨ Explore Data Science 📝 Write your paper

Data Science | Machine Learning with Python for Researchers

✨MinerU: An Open-Source Solution for Precise Document Content Extraction

📝 Summary:
MinerU is an open-source tool that provides high-precision document content extraction. It uses fine-tuned models and pre/postprocessing rules to consistently achieve high performance across diverse document types.

🔹 Publication Date: Published on Sep 27, 2024

🔹 Paper Links:
• arXiv Page: https://arxiv.org/pdf/2409.18839
• PDF: https://huggingface.co/spaces/Echo9k/PDF_reader
• Github: https://github.com/opendatalab/MinerU

✨ Spaces citing this paper:
• https://huggingface.co/spaces/opendatalab/MinerU
• https://huggingface.co/spaces/xiaoye-winters/MinerU-API
• https://huggingface.co/spaces/ApeAITW/MinerU_2.5_Test

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#DocumentExtraction #OpenSource #DataScience #NLP #AI

52 views05:58

✨ Explore Data Science 📝 Write your paper

Data Science | Machine Learning with Python for Researchers

🤖🧠 Krea Realtime 14B: Redefining Real-Time Video Generation with AI

🗓️ 05 Nov 2025
📚 AI News & Trends

The field of artificial intelligence is undergoing a remarkable transformation and one of the most exciting developments is the rise of real-time video generation. From cinematic visual effects to immersive virtual environments, AI is rapidly blurring the boundaries between imagination and reality. At the forefront of this innovation stands Krea Realtime 14B, an advanced open-source ...

#AI #RealTimeVideo #ArtificialIntelligence #OpenSource #VideoGeneration #KreaRealtime14B

255 views22:30

📖 Read More

📣 BEST TELEGRAM CHANNELS

Data Science | Machine Learning with Python for Researchers

🤖🧠 FIBO: The First JSON-Native, Open-Source Text-to-Image Model Built for Real-World Control and Accuracy

🗓️ 07 Nov 2025
📚 AI News & Trends

The world of generative AI has evolved rapidly with text-to-image tools enabling creators, marketers, designers and enterprises to bring ideas to life with unprecedented ease. However, most existing models have a clear limitation: they prioritize imagination at the cost of control. Whether producing inconsistent styles, unpredictable lighting or drifting away from user prompts, traditional models ...

#FIBO #TextToImage #GenerativeAI #OpenSource #JSONNative #RealWorldControl

499 views01:33

📖 Read More

📣 BEST TELEGRAM CHANNELS

Data Science | Machine Learning with Python for Researchers

✨OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

📝 Summary:
OmniVinci is an open-source omni-modal LLM that improves cross-modal understanding for audio, vision, and robotics. It features innovative architecture for better embedding alignment and temporal capture, along with efficient data curation. OmniVinci outperforms competitors while using significan...

🔹 Publication Date: Published on Oct 17

🔹 Paper Links:
• arXiv Page: https://arxivexplained.com/papers/omnivinci-enhancing-architecture-and-data-for-omni-modal-understanding-llm
• PDF: https://arxiv.org/pdf/2510.15870
• Project Page: https://nvlabs.github.io/OmniVinci/
• Github: https://github.com/NVlabs/OmniVinci

🔹 Models citing this paper:
• https://huggingface.co/nvidia/omnivinci

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#LLM #MultimodalAI #Robotics #DeepLearning #OpenSource

512 views09:00

✨ Explore Data Science 📝 Write your paper

Data Science | Machine Learning with Python for Researchers

🤖🧠 Meilisearch: The Lightning-Fast, AI-Ready Search Engine for Modern Applications

🗓️ 08 Nov 2025
📚 AI News & Trends

Search is no longer a luxury feature. Today’s users expect instant, relevant results across e-commerce platforms, SaaS tools, media libraries and knowledge systems. With AI-powered experiences becoming the new standard, developers need search infrastructure that is fast, flexible, developer-friendly and ready for hybrid semantic search. This is where Meilisearch stands out. Meilisearch is an open-source, ...

#Meilisearch #AIReadySearch #LightningFast #SearchEngine #ModernApplications #OpenSource

280 views21:33

📖 Read More

📣 BEST TELEGRAM CHANNELS

Data Science | Machine Learning with Python for Researchers

0:00

Media is too big

VIEW IN TELEGRAM

✨UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist

📝 Summary:
UniVA is an open-source multi-agent framework that unifies video understanding, segmentation, editing, and generation. It uses a Plan-and-Act architecture with hierarchical memory to enable complex, iterative video workflows. This system aims to advance agentic video intelligence.

🔹 Publication Date: Published on Nov 11

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.08521
• PDF: https://arxiv.org/pdf/2511.08521
• Project Page: https://univa.online/
• Github: https://github.com/univa-agent/univa

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#VideoAI #AIagents #GenerativeAI #ComputerVision #OpenSource

252 views11:02

✨ Explore Data Science 📝 Write your paper

Data Science | Machine Learning with Python for Researchers

🤖🧠 Steel Browser: The Open-Source Browser API Powering AI Agents and Automation

🗓️ 16 Nov 2025
📚 AI News & Trends

The evolution of artificial intelligence has ushered in a new era of automation where AI agents can perform complex digital tasks with minimal human intervention. However, one of the biggest challenges for developers building these systems is browser automation managing sessions, proxies, cookies and debugging environments. This is where Steel Browser comes into play. Steel ...

#SteelBrowser #OpenSource #BrowserAutomation #AIAgents #WebScraping #DigitalAutomation

👍1

376 views06:44

📖 Read More

📣 BEST TELEGRAM CHANNELS

Data Science | Machine Learning with Python for Researchers

🤖🧠 Skyvern: The Future of Browser Automation Powered by AI and Computer Vision

🗓️ 16 Nov 2025
📚 AI News & Trends

In today’s fast-evolving digital landscape, automation plays a crucial role in enhancing productivity, efficiency and innovation. Yet, traditional browser automation tools often struggle with complexity, maintenance and reliability. They rely heavily on DOM parsing, XPaths and rigid scripts that easily break when websites change their layout. Enter Skyvern, an open-source, AI-driven browser automation platform developed ...

#Skyvern #BrowserAutomation #AIDriven #ComputerVision #OpenSource #WebAutomation

❤‍🔥1❤1👍1

393 views13:44

📖 Read More

📣 BEST TELEGRAM CHANNELS

Data Science | Machine Learning with Python for Researchers

✨P1: Mastering Physics Olympiads with Reinforcement Learning

📝 Summary:
P1 is a family of open-source physics reasoning models trained via reinforcement learning. P1-235B-A22B achieved Gold-medal performance at IPhO 2025 and won 12 other competitions. These models also show strong generalizability on other reasoning tasks.

🔹 Publication Date: Published on Nov 17

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.13612
• PDF: https://arxiv.org/pdf/2511.13612
• Project Page: https://prime-rl.github.io/P1/
• Github: https://github.com/PRIME-RL/P1

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#ReinforcementLearning #Physics #AI #MachineLearning #OpenSource

204 views06:06

✨ Explore Data Science 📝 Write your paper

Data Science | Machine Learning with Python for Researchers

✨Instella: Fully Open Language Models with Stellar Performance

📝 Summary:
Instella is a family of fully open language models trained on open data. It achieves state-of-the-art among fully open models and competes with leading open-weight LLMs. Specialized variants for long context and math reasoning are also offered.

🔹 Publication Date: Published on Nov 13

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.10628
• PDF: https://arxiv.org/pdf/2511.10628
• Github: https://github.com/AMD-AGI/Instella

🔹 Models citing this paper:
• https://huggingface.co/amd/AMD-OLMo
• https://huggingface.co/amd/Instella-3B-Instruct
• https://huggingface.co/amd/Instella-3B

✨ Datasets citing this paper:
• https://huggingface.co/datasets/amd/Instella-Long
• https://huggingface.co/datasets/amd/Instella-GSM8K-synthetic

✨ Spaces citing this paper:
• https://huggingface.co/spaces/DexterSptizu/AMD-OLMo-1B
• https://huggingface.co/spaces/universeofml/DeepFocusTrain

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#LLMs #OpenSource #AI #MachineLearning #NLP

arXiv.org

Instella: Fully Open Language Models with Stellar Performance

Large language models (LLMs) have demonstrated remarkable performance across a wide range of tasks, yet the majority of high-performing models remain closed-source or partially open, limiting...

❤1

281 views11:08

✨ Explore Data Science 📝 Write your paper

Data Science | Machine Learning with Python for Researchers

✨OpenUS: A Fully Open-Source Foundation Model for Ultrasound Image Analysis via Self-Adaptive Masked Contrastive Learning

📝 Summary:
OpenUS is an open-source ultrasound foundation model built on a large public dataset. It uses a vision Mamba backbone and a novel self-adaptive masking framework to enhance pre-training, enabling label-efficient fine-tuning for various US tasks.

🔹 Publication Date: Published on Nov 14

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.11510
• PDF: https://arxiv.org/pdf/2511.11510
• Github: https://github.com/XZheng0427/OpenUS

==================================

For more data science resources:
✓ https://t.iss.one/DataScienceT

#OpenSource #FoundationModel #UltrasoundAI #MachineLearning #MedicalImaging

❤1

146 views22:10

✨ Explore Data Science 📝 Write your paper

About

Blog

Apps

Platform