Github LLMs – Telegram

Github LLMs

750 subscribers

39 photos

3 videos

4 files

53 links

LLM projects
@Raminmousa

Download Telegram

About

Blog

Apps

Platform

750 subscribers

Channel created

14:57

graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Creator: Microsoft
Stars ⭐️: 13.7k
Forked By: 1.2k
GitHub Repo:
https://github.com/microsoft/graphrag

➖➖➖➖➖➖➖➖➖➖➖➖➖➖
Join @deep_learning_proj

GitHub - microsoft/graphrag: A modular graph-based Retrieval-Augmented Generation (RAG) system

A modular graph-based Retrieval-Augmented Generation (RAG) system - microsoft/graphrag

👍1

472 views17:06

firecrawl

Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

Creator: Mendable
Stars ⭐️: 12.3k
Forked By: 861
GitHub Repo:
https://github.com/mendableai/firecrawl

✅

https://t.iss.one/deep_learning_proj

Please open Telegram to view this post

VIEW IN TELEGRAM

GitHub - mendableai/firecrawl: 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with…

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API. - mendableai/firecrawl

👍2

565 views08:42

🖥

Awesome LLM Strawberry (OpenAI o1)

▪ Github

✅

https://t.iss.one/deep_learning_proj

Please open Telegram to view this post

VIEW IN TELEGRAM

👍2

4.06K viewsedited 19:35

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Creator: OpenBMB
Stars ⭐️: 11.4k
Forked By: 798
GitHub Repo:
https://github.com/OpenBMB/MiniCPM-V

➖➖➖➖➖➖➖➖➖➖➖➖➖➖
Join ✅https://t.iss.one/deep_learning_proj

Please open Telegram to view this post

VIEW IN TELEGRAM

GitHub - OpenBMB/MiniCPM-o: MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone - OpenBMB/MiniCPM-o

576 views11:22

LLM based Multi-Agent methods

🖥

Github: https://github.com/AgnostiqHQ/multi-agent-llm

📕

Paper: https://arxiv.org/abs/2409.12618v1

🤗 Dataset: https://paperswithcode.com/dataset/hotpotqa

✅

https://t.iss.one/deep_learning_proj

Please open Telegram to view this post

VIEW IN TELEGRAM

GitHub - AgnostiqHQ/multi-agent-llm: Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)

Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT) - AgnostiqHQ/multi-agent-llm

4.56K views17:38

🌟 GRIN MoE: Mixture-of-Experts от Microsoft.

🟢total parameters: 16x3.8B;
🟢active parameters: 6.6B;
🟢context length: 4096;
🟢number of embeddings 4096;
🟢number of layers: 32;

✅

https://t.iss.one/deep_learning_proj

🟡

🟡

🖥

Please open Telegram to view this post

VIEW IN TELEGRAM

3.06K views15:11

llama-stack

Model components of the Llama Stack APIs

Creator: Meta Llama
Stars ⭐️: 1.5k
Forked By: 137
https://github.com/meta-llama/llama-stack

✅

https://t.iss.one/deep_learning_proj

Please open Telegram to view this post

VIEW IN TELEGRAM

GitHub - meta-llama/llama-stack: Composable building blocks to build Llama Apps

Composable building blocks to build Llama Apps. Contribute to meta-llama/llama-stack development by creating an account on GitHub.

707 views12:42

Crawl 4 AI

Crawl4AI: Open-source LLM Friendly Web Crawler & Scrapper

Creator: UncleCode
Stars ⭐️: 8.6k
Forked By: 627
https://github.com/unclecode/crawl4ai

✅

https://t.iss.one/deep_learning_proj

Please open Telegram to view this post

VIEW IN TELEGRAM

GitHub - unclecode/crawl4ai: 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://dis…

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN - unclecode/crawl4ai

4.29K views09:19

🔥 NVIDIA silently release a Llama 3.1 70B fine-tune that outperforms
GPT-4o and Claude Sonnet 3.5

Llama 3.1 Nemotron 70B Instruct a further RLHFed model on
huggingface

https://huggingface.co/collections/nvidia/llama-31-nemotron-70b-670e93cd366feea16abc13d8

✅

https://t.iss.one/deep_learning_proj

Please open Telegram to view this post

VIEW IN TELEGRAM

4.15K viewsedited 20:09

🌟 Zamba2-Instruct

В семействе 2 модели:

🟢

Zamba2-1.2B-instruct;

🟠

Zamba2-2.7B-instruct.

# Clone repo
git clone https://github.com/Zyphra/transformers_zamba2.git
cd transformers_zamba2

# Install the repository & accelerate:
pip install -e .
pip install accelerate

# Inference:
from transformers import AutoTokenizer, AutoModelForCausalLM
import torch

tokenizer = AutoTokenizer.from_pretrained("Zyphra/Zamba2-2.7B-instruct")
model = AutoModelForCausalLM.from_pretrained("Zyphra/Zamba2-2.7B-instruct", device_map="cuda", torch_dtype=torch.bfloat16)

user_turn_1 = "user_prompt1."
assistant_turn_1 = "assistant_prompt."
user_turn_2 = "user_prompt2."
sample = [{'role': 'user', 'content': user_turn_1}, {'role': 'assistant', 'content': assistant_turn_1}, {'role': 'user', 'content': user_turn_2}]
chat_sample = tokenizer.apply_chat_template(sample, tokenize=False)

input_ids = tokenizer(chat_sample, return_tensors='pt', add_special_tokens=False).to("cuda")
outputs = model.generate(**input_ids, max_new_tokens=150, return_dict_in_generate=False, output_scores=False, use_cache=True, num_beams=1, do_sample=False)
print((tokenizer.decode(outputs[0])))

🖥

GitHub

https://t.iss.one/deep_learning_proj

Please open Telegram to view this post

VIEW IN TELEGRAM

👍2

3.51K viewsedited 19:16

📖

LLM-Agent-Paper-List is a repository of papers on the topic of agents based on large language models (LLM)! The papers are divided into categories such as LLM agent architectures, autonomous LLM agents, reinforcement learning (RL), natural language processing methods, multimodal approaches and tools for developing LLM agents, and more.

🖥

Github

https://t.iss.one/deep_learning_proj

Please open Telegram to view this post

VIEW IN TELEGRAM

👍3

4.06K viewsedited 04:36

https://github.com/andrewyng/aisuite
#LLMs

https://t.iss.one/deep_learning_proj

2.99K viewsedited 18:51

LLM-based agents for Software Engineering
"Large Language Model-Based Agents for Software Engineering: A Survey".

https://github.com/FudanSELab/Agent4SE-Paper-List.

https://t.iss.one/deep_learning_proj

3.06K viewsedited 19:26

Welcome to Ollama's Prompt Engineering Interactive Tutorial

🔗 Github

https://t.iss.one/deep_learning_proj

👍3

3.33K viewsedited 14:24

Forwarded from Machine learning books and papers

⚡️ MobileLLM

🟢

MobileLLM-125M. 30 Layers, 9 Attention Heads, 3 KV Heads. 576 Token Dimension;

🟢

MobileLLM-350M. 32 Layers, 15 Attention Heads, 5 KV Heads. 960 Token Dimension;

🟢

MobileLLM-600M. 40 Layers, 18 Attention Heads, 6 KV Heads. 1152 Token Dimension;

🟢

MobileLLM-1B. 54 Layers, 20 Attention Heads, 5 KV Heads. 1280 Token Dimension;

🟡

🖥

GitHub

@Machine_learn

Please open Telegram to view this post

VIEW IN TELEGRAM

930 views03:00

Fine_Tuning_LLMs_with_Hugging_Face_Partial_Code.ipynb

Fine Tuning LLMs with Hugging Face LLMs Code

https://t.iss.one/deep_learning_proj

3.46K viewsedited 03:04

Forwarded from Machine learning books and papers

🌟 BioNeMo: A Framework for Developing AI Models for Drug Design.

NVIDIA BioNeMo2 Framework is a set of tools, libraries, and models for computational drug discovery and design.

▶️ Pre-trained models:

🟢

ESM-2 is a pre-trained bidirectional encoder (BERT-like) for amino acid sequences. BioNeMo2 includes checkpoints with parameters 650M and 3B;

🟢

Geneformer is a tabular scoring model that generates a dense representation of a cell's scRNA by examining co-expression patterns in individual cells.

▶️ Datasets:

🟠

CELLxGENE is a collection of publicly available single-cell datasets collected by the CZI (Chan Zuckerberg Initiative) with a total volume of 24 million cells;

🟠

UniProt is a database of clustered sets of protein sequences from UniProtKB, created on the basis of translated genomic data.

🟡

🟡

🖥

GitHub

@Machine_learn

Please open Telegram to view this post

VIEW IN TELEGRAM

👍2

913 views15:15

🌟 LLaMA-Mesh:

🟡

🖥

GitHub

https://t.iss.one/deep_learning_proj

Please open Telegram to view this post

VIEW IN TELEGRAM

👍1

4.14K viewsedited 20:40