Machine Learning
39.2K subscribers
3.83K photos
32 videos
41 files
1.3K links
Machine learning insights, practical tutorials, and clear explanations for beginners and aspiring data scientists. Follow the channel for models, algorithms, coding guides, and real-world ML applications.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
๐Ÿ”ฅ Trending Repository: sim

๐Ÿ“ Description: Sim is an open-source AI agent workflow builder. Sim Studio's interface is a lightweight, intuitive way to quickly build and deploy LLMs that connect with your favorite tools.

๐Ÿ”— Repository URL: https://github.com/simstudioai/sim

๐ŸŒ Website: https://www.sim.ai

๐Ÿ“– Readme: https://github.com/simstudioai/sim#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 7.7K stars
๐Ÿ‘€ Watchers: 56
๐Ÿด Forks: 1K forks

๐Ÿ’ป Programming Languages: TypeScript - MDX - Python - CSS - Shell - Smarty

๐Ÿท๏ธ Related Topics:
#react #automation #typescript #ai #nextjs #chatbot #artificial_intelligence #gemini #openai #agents #low_code #no_code #rag #anthropic #deepseek #aiagents #agentic_workflow #agent_workflow


==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
โค1
๐Ÿ”ฅ Trending Repository: firecrawl

๐Ÿ“ Description: The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data ๐Ÿ”ฅ

๐Ÿ”— Repository URL: https://github.com/firecrawl/firecrawl

๐ŸŒ Website: https://firecrawl.dev

๐Ÿ“– Readme: https://github.com/firecrawl/firecrawl#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 50.2K stars
๐Ÿ‘€ Watchers: 230
๐Ÿด Forks: 4.4K forks

๐Ÿ’ป Programming Languages: TypeScript - Python - Rust - JavaScript - Jupyter Notebook - Shell

๐Ÿท๏ธ Related Topics:
#markdown #crawler #data #scraper #ai #html_to_markdown #web_crawler #scraping #webscraping #rag #llm #ai_scraping


==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
โค1
๐Ÿ”ฅ Trending Repository: sim

๐Ÿ“ Description: Sim is an open-source AI agent workflow builder. Sim's interface is a lightweight, intuitive way to rapidly build and deploy LLMs that connect with your favorite tools.

๐Ÿ”— Repository URL: https://github.com/simstudioai/sim

๐ŸŒ Website: https://www.sim.ai

๐Ÿ“– Readme: https://github.com/simstudioai/sim#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 11.6K stars
๐Ÿ‘€ Watchers: 68
๐Ÿด Forks: 1.4K forks

๐Ÿ’ป Programming Languages: TypeScript - MDX - Python - CSS - Shell - Smarty

๐Ÿท๏ธ Related Topics:
#react #automation #typescript #ai #nextjs #chatbot #artificial_intelligence #gemini #openai #agents #low_code #no_code #rag #anthropic #deepseek #aiagents #agentic_workflow #agent_workflow


==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
๐Ÿ”ฅ Trending Repository: SQLBot

๐Ÿ“ Description: ๅŸบไบŽๅคงๆจกๅž‹ๅ’Œ RAG ็š„ๆ™บ่ƒฝ้—ฎๆ•ฐ็ณป็ปŸใ€‚Text-to-SQL Generation via LLMs using RAG.

๐Ÿ”— Repository URL: https://github.com/dataease/SQLBot

๐ŸŒ Website: https://dataease.cn/sqlbot/

๐Ÿ“– Readme: https://github.com/dataease/SQLBot#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 968 stars
๐Ÿ‘€ Watchers: 13
๐Ÿด Forks: 113 forks

๐Ÿ’ป Programming Languages: Python - CSS - TypeScript - JavaScript - Shell - HTML

๐Ÿท๏ธ Related Topics:
#text_to_sql #rag #nl2sql #text2sql #llm #sqlbot #deepseek #chatbi


==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
๐Ÿ”ฅ Trending Repository: SurfSense

๐Ÿ“ Description: Open Source Alternative to NotebookLM / Perplexity, connected to external sources such as Search Engines, Slack, Linear, Jira, ClickUp, Confluence, Notion, YouTube, GitHub, Discord and more. Join our discord:https://discord.gg/ejRNvftDp9

๐Ÿ”— Repository URL: https://github.com/MODSetter/SurfSense

๐ŸŒ Website: https://www.surfsense.net

๐Ÿ“– Readme: https://github.com/MODSetter/SurfSense#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 6.7K stars
๐Ÿ‘€ Watchers: 46
๐Ÿด Forks: 507 forks

๐Ÿ’ป Programming Languages: Python - TypeScript - MDX - CSS - JavaScript - Dockerfile

๐Ÿท๏ธ Related Topics:
#python #chrome_extension #slack #agent #jira #typescript #extension #ai #nextjs #agents #notion #perplexity #rag #fastapi #langchain #ollama #langgraph #nextjs15 #aceternity_ui #notebooklm


==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
๐Ÿ”ฅ Trending Repository: WrenAI

๐Ÿ“ Description: โšก๏ธ GenBI (Generative BI) queries any database in natural language, generates accurate SQL (Text-to-SQL), charts (Text-to-Chart), and AI-powered insights in seconds.

๐Ÿ”— Repository URL: https://github.com/Canner/WrenAI

๐ŸŒ Website: https://getwren.ai/oss

๐Ÿ“– Readme: https://github.com/Canner/WrenAI#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 10.1K stars
๐Ÿ‘€ Watchers: 70
๐Ÿด Forks: 1K forks

๐Ÿ’ป Programming Languages: TypeScript - Python - Go - JavaScript - Less - Dockerfile

๐Ÿท๏ธ Related Topics:
#agent #bigquery #charts #sql #postgresql #bedrock #business_intelligence #openai #spreadsheets #vertex #genbi #text_to_sql #rag #text2sql #duckdb #llm #anthropic #sqlai #text_to_chart


==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
๐Ÿ”ฅ Trending Repository: chroma

๐Ÿ“ Description: Open-source search and retrieval database for AI applications.

๐Ÿ”— Repository URL: https://github.com/chroma-core/chroma

๐ŸŒ Website: https://www.trychroma.com/

๐Ÿ“– Readme: https://github.com/chroma-core/chroma#readme

๐Ÿ“Š Statistics:
๐ŸŒŸ Stars: 22.2K stars
๐Ÿ‘€ Watchers: 121
๐Ÿด Forks: 1.8K forks

๐Ÿ’ป Programming Languages: Rust - Python - TypeScript - Go - Jupyter Notebook - JavaScript

๐Ÿท๏ธ Related Topics:
#rust #database #ai #embeddings #rust_lang #document_retrieval #rag #vector_database #llm #llms


==================================
๐Ÿง  By: https://t.iss.one/DataScienceM
๐Ÿค–๐Ÿง  Cognee: Powerful Memory for AI Agents in Just 6 Lines of Code

๐Ÿ—“๏ธ 07 Oct 2025
๐Ÿ“š AI News & Trends

Artificial Intelligence is evolving rapidly, but one of the biggest challenges for developers is building agents that remember, reason and adapt. Traditional RAG (Retrieval-Augmented Generation) systems often fall short when handling context, scalability and precision. Thatโ€™s where Cognee comes in. It is an open-source framework designed to provide AI agents with memory using a unique ...

#AI #Memory #AIAgents #OpenSource #RAG #ArtificialIntelligence
โค3
๐Ÿ“Œ How to Evaluate Retrieval Quality in RAG Pipelines (part 2): Mean Reciprocal Rank (MRR) and Average Precision (AP)

๐Ÿ—‚ Category: LARGE LANGUAGE MODELS

๐Ÿ•’ Date: 2025-11-05 | โฑ๏ธ Read time: 9 min read

Enhance your RAG pipeline's performance by effectively evaluating its retrieval quality. This guide, the second in a series, explores the use of key binary, order-aware metrics. It provides a detailed look at Mean Reciprocal Rank (MRR) and Average Precision (AP), essential tools for ensuring your system retrieves the most relevant information first and improves overall accuracy.

#RAG #LLM #AIEvaluation #MachineLearning
๐Ÿ“Œ Multi-Agent SQL Assistant, Part 2: Building a RAG Manager

๐Ÿ—‚ Category: AI APPLICATIONS

๐Ÿ•’ Date: 2025-11-06 | โฑ๏ธ Read time: 21 min read

Explore building a multi-agent SQL assistant in this hands-on guide to creating a RAG Manager. Part 2 of this series provides a practical comparison of multiple Retrieval-Augmented Generation strategies, weighing traditional keyword search against modern vector-based approaches using FAISS and Chroma. Learn how to select and implement the most effective retrieval method to enhance your AI assistant's performance and accuracy when interacting with databases.

#RAG #SQL #AI #VectorSearch #LLM
โค1
๐Ÿ“Œ Do You Really Need GraphRAG? A Practitionerโ€™s Guide Beyond the Hype

๐Ÿ—‚ Category: LARGE LANGUAGE MODELS

๐Ÿ•’ Date: 2025-11-11 | โฑ๏ธ Read time: 15 min read

Go beyond the hype with this practitioner's guide to GraphRAG. This article offers a critical perspective on the advanced RAG technique, exploring essential design best practices, common challenges, and key learnings from real-world implementation. It provides a framework to help you decide if GraphRAG is the right solution for your specific needs, moving past the buzz to focus on practical application.

#GraphRAG #RAG #AI #KnowledgeGraphs #LLM
๐Ÿ“Œ How to Evaluate Retrieval Quality in RAG Pipelines (Part 3): DCG@k and NDCG@k

๐Ÿ—‚ Category: LARGE LANGUAGE MODELS

๐Ÿ•’ Date: 2025-11-12 | โฑ๏ธ Read time: 8 min read

This final part of the series on RAG pipeline evaluation explores advanced metrics for assessing retrieval quality. Learn how to use Discounted Cumulative Gain (DCG@k) and Normalized Discounted Cumulative Gain (NDCG@k) to measure the relevance and ranking of retrieved documents, moving beyond simpler metrics for a more nuanced understanding of your system's performance.

#RAG #EvaluationMetrics #LLM #InformationRetrieval #MLOps
โค5
๐Ÿ“Œ How to Build an Over-Engineered Retrieval System

๐Ÿ—‚ Category: LARGE LANGUAGE MODELS

๐Ÿ•’ Date: 2025-11-18 | โฑ๏ธ Read time: 53 min read

This article breaks down the process of building a deliberately complex, or 'over-engineered,' retrieval system. It offers a practical look at advanced architectures and methods that, despite their complexity, are used in real-world scenarios for powerful information retrieval and RAG applications. It's an exploration of intricate designs that are surprisingly common in practice.

#RAG #SystemDesign #SoftwareArchitecture #InformationRetrieval
โค3
๐Ÿ“Œ Introducing Googleโ€™s File Search Tool

๐Ÿ—‚ Category: AI APPLICATIONS

๐Ÿ•’ Date: 2025-11-18 | โฑ๏ธ Read time: 12 min read

Google has introduced its new File Search Tool, a direct challenge to traditional Retrieval-Augmented Generation (RAG) processing. This latest move by the search giant signals a significant development in AI-powered information retrieval, aiming to offer a more advanced alternative to conventional methods for searching and processing files.

#Google #AI #RAG #FileSearch
โค3
๐Ÿ“Œ How to Perform Agentic Information Retrieval

๐Ÿ—‚ Category: AGENTIC AI

๐Ÿ•’ Date: 2025-11-19 | โฑ๏ธ Read time: 9 min read

Leverage the power of autonomous AI agents for advanced information retrieval. This guide explores Agentic Information Retrieval, a method for deploying intelligent agents to proactively search, analyze, and extract precise information from your document corpus. Go beyond traditional keyword search and streamline complex data discovery with this cutting-edge technique.

#AIagents #InformationRetrieval #AgenticAI #RAG
โค3
๐Ÿ“Œ The Architecture Behind Web Search in AI Chatbots

๐Ÿ—‚ Category: LLM APPLICATIONS

๐Ÿ•’ Date: 2025-12-04 | โฑ๏ธ Read time: 16 min read

Explore the technical architecture powering web search in AI chatbots. This analysis breaks down how generative models retrieve and integrate live web data to provide current answers, highlighting the crucial shift towards Generative Engine Optimization (GEO). Learn what this new paradigm means for content visibility in an AI-first search landscape, moving beyond traditional SEO.

#AI #GEO #Chatbots #Search #RAG
โค2
๐Ÿค–๐Ÿง  LEANN: The Bright Future of Lightweight, Private, and Scalable Vector Databases

๐Ÿ—“๏ธ 24 Nov 2025
๐Ÿ“š AI News & Trends

In the rapidly expanding world of artificial intelligence, data storage and retrieval efficiency have become major bottlenecks for scalable AI systems. The growth of Retrieval-Augmented Generation (RAG) and Large Language Models (LLMs) has further intensified the demand for fast, private and space-efficient vector databases. Traditional systems like FAISS or Milvus while powerful, are resource-heavy and ...

#LEANN #LightweightVectorDatabases #PrivateAI #ScalableAI #RAG #AIDataStorage
โค1