YangLing0818/RPG-DiffusionMaster
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
Language: Python
#image_editing #large_language_models #multimodal_large_language_models #text_to_image_diffusion
Stars: 272 Issues: 5 Forks: 14
https://github.com/YangLing0818/RPG-DiffusionMaster
  
  Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
Language: Python
#image_editing #large_language_models #multimodal_large_language_models #text_to_image_diffusion
Stars: 272 Issues: 5 Forks: 14
https://github.com/YangLing0818/RPG-DiffusionMaster
GitHub
  
  GitHub - YangLing0818/RPG-DiffusionMaster: [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating…
  [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG) - YangLing0818/RPG-DiffusionMaster
  aishwaryanr/awesome-generative-ai-guide
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
#awesome #awesome_list #generative_ai #interview_questions #large_language_models #llms #notebook_jupyter #vision_and_language
Stars: 332 Issues: 0 Forks: 57
https://github.com/aishwaryanr/awesome-generative-ai-guide
  
  A one stop repository for generative AI research updates, interview resources, notebooks and much more!
#awesome #awesome_list #generative_ai #interview_questions #large_language_models #llms #notebook_jupyter #vision_and_language
Stars: 332 Issues: 0 Forks: 57
https://github.com/aishwaryanr/awesome-generative-ai-guide
GitHub
  
  GitHub - aishwaryanr/awesome-generative-ai-guide: A one stop repository for generative AI research updates, interview resources…
  A one stop repository for generative AI research updates, interview resources, notebooks and much more! - aishwaryanr/awesome-generative-ai-guide
🔥5👍1
  BradyFU/Video-MME
✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
Language: Python
#large_language_models #large_vision_language_models #mme #multimodal_large_language_models #video #video_mme
Stars: 182 Issues: 1 Forks: 6
https://github.com/BradyFU/Video-MME
  
  ✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
Language: Python
#large_language_models #large_vision_language_models #mme #multimodal_large_language_models #video #video_mme
Stars: 182 Issues: 1 Forks: 6
https://github.com/BradyFU/Video-MME
GitHub
  
  GitHub - BradyFU/Video-MME: ✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video…
  ✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis - BradyFU/Video-MME
  zou-group/textgrad
Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.
Language: Python
#compound_systems #large_language_models #prompt_optimization #textual_gradients
Stars: 267 Issues: 2 Forks: 16
https://github.com/zou-group/textgrad
  
  Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.
Language: Python
#compound_systems #large_language_models #prompt_optimization #textual_gradients
Stars: 267 Issues: 2 Forks: 16
https://github.com/zou-group/textgrad
GitHub
  
  GitHub - zou-group/textgrad: TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual…
  TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients. Published in Nature. - zou-group/textgrad
👍2
  QwenLM/Qwen2-Math
A series of math-specific large language models of our Qwen2 series.
Language: Python
#large_language_models #mathematics #qwen2
Stars: 252 Issues: 1 Forks: 13
https://github.com/QwenLM/Qwen2-Math
  
  A series of math-specific large language models of our Qwen2 series.
Language: Python
#large_language_models #mathematics #qwen2
Stars: 252 Issues: 1 Forks: 13
https://github.com/QwenLM/Qwen2-Math
GitHub
  
  GitHub - QwenLM/Qwen2.5-Math: A series of math-specific large language models of our Qwen2 series.
  A series of math-specific large language models of our Qwen2 series. - QwenLM/Qwen2.5-Math
  ictnlp/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Language: Python
#large_language_models #multimodal_large_language_models #speech_interaction #speech_language_model #speech_to_speech #speech_to_text
Stars: 274 Issues: 1 Forks: 16
https://github.com/ictnlp/LLaMA-Omni
  
  LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Language: Python
#large_language_models #multimodal_large_language_models #speech_interaction #speech_language_model #speech_to_speech #speech_to_text
Stars: 274 Issues: 1 Forks: 16
https://github.com/ictnlp/LLaMA-Omni
GitHub
  
  GitHub - ictnlp/LLaMA-Omni: LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1…
  LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level. - ictnlp/LLaMA-Omni
  ictnlp/LLaVA-Mini
LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.
Language: Python
#efficient #gpt4o #gpt4v #large_language_models #large_multimodal_models #llama #llava #multimodal #multimodal_large_language_models #video #vision #vision_language_model #visual_instruction_tuning
Stars: 173 Issues: 7 Forks: 11
https://github.com/ictnlp/LLaVA-Mini
  
  LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.
Language: Python
#efficient #gpt4o #gpt4v #large_language_models #large_multimodal_models #llama #llava #multimodal #multimodal_large_language_models #video #vision #vision_language_model #visual_instruction_tuning
Stars: 173 Issues: 7 Forks: 11
https://github.com/ictnlp/LLaVA-Mini
GitHub
  
  GitHub - ictnlp/LLaVA-Mini: LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images,…
  LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.  - GitHub - ictnlp/LLaVA-Mini: LLaVA-Mi...
  HKUDS/MiniRAG
"MiniRAG: Making RAG Simpler with Small and Free Language Models"
Language: Python
#large_language_models #rag #retrieval_augmented_generation
Stars: 221 Issues: 1 Forks: 27
https://github.com/HKUDS/MiniRAG
  
  "MiniRAG: Making RAG Simpler with Small and Free Language Models"
Language: Python
#large_language_models #rag #retrieval_augmented_generation
Stars: 221 Issues: 1 Forks: 27
https://github.com/HKUDS/MiniRAG
GitHub
  
  GitHub - HKUDS/MiniRAG: "MiniRAG: Making RAG Simpler with Small and Open-Sourced Language Models"
  "MiniRAG: Making RAG Simpler with Small and Open-Sourced Language Models" - HKUDS/MiniRAG
  HKUDS/VideoRAG
"VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos"
Language: Python
#large_language_models #llms #long_video_understanding #multi_modal_llms #rag #retrieval_augmented_generation
Stars: 201 Issues: 1 Forks: 14
https://github.com/HKUDS/VideoRAG
  
  "VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos"
Language: Python
#large_language_models #llms #long_video_understanding #multi_modal_llms #rag #retrieval_augmented_generation
Stars: 201 Issues: 1 Forks: 14
https://github.com/HKUDS/VideoRAG
GitHub
  
  GitHub - HKUDS/VideoRAG: "VideoRAG: Chat with Your Videos"
  "VideoRAG: Chat with Your Videos". Contribute to HKUDS/VideoRAG development by creating an account on GitHub.
⚡1👍1
  cxcscmu/Crawl4LLM
Official repository for "Crawl4LLM: Efficient Web Crawling for LLM Pretraining"
Language: Python
#crawler #crawling #large_language_models #llm #pre_training #pretraining #web_crawler #web_crawling
Stars: 359 Issues: 0 Forks: 25
https://github.com/cxcscmu/Crawl4LLM
  
  Official repository for "Crawl4LLM: Efficient Web Crawling for LLM Pretraining"
Language: Python
#crawler #crawling #large_language_models #llm #pre_training #pretraining #web_crawler #web_crawling
Stars: 359 Issues: 0 Forks: 25
https://github.com/cxcscmu/Crawl4LLM
GitHub
  
  GitHub - cxcscmu/Craw4LLM: Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"
  Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining" - cxcscmu/Craw4LLM
🔥2
  KalyanKS-NLP/rag-zero-to-hero-guide
Comprehensive guide to learn RAG from basics to advanced.
Language: Jupyter Notebook
#ai_engineer #generative_ai #large_language_models #llm_engineer #llm_rag #llms #retrieval_augmented_generation
Stars: 344 Issues: 0 Forks: 89
https://github.com/KalyanKS-NLP/rag-zero-to-hero-guide
  
  Comprehensive guide to learn RAG from basics to advanced.
Language: Jupyter Notebook
#ai_engineer #generative_ai #large_language_models #llm_engineer #llm_rag #llms #retrieval_augmented_generation
Stars: 344 Issues: 0 Forks: 89
https://github.com/KalyanKS-NLP/rag-zero-to-hero-guide
GitHub
  
  GitHub - KalyanKS-NLP/rag-zero-to-hero-guide: Comprehensive guide to learn RAG from basics to advanced.
  Comprehensive guide to learn RAG from basics to advanced.  - GitHub - KalyanKS-NLP/rag-zero-to-hero-guide: Comprehensive guide to learn RAG from basics to advanced.
  QwenLM/ParScale
Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling
Language: Python
#large_language_models #llm #machine_learning #scaling_law
Stars: 222 Issues: 1 Forks: 9
https://github.com/QwenLM/ParScale
  
  Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling
Language: Python
#large_language_models #llm #machine_learning #scaling_law
Stars: 222 Issues: 1 Forks: 9
https://github.com/QwenLM/ParScale
GitHub
  
  GitHub - QwenLM/ParScale: Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling
  Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling - QwenLM/ParScale
  MiniMax-AI/MiniMax-M1
MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.
Language: Python
#large_language_models #llm #minimax_m1 #reasoning_models
Stars: 328 Issues: 3 Forks: 9
https://github.com/MiniMax-AI/MiniMax-M1
  
  MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.
Language: Python
#large_language_models #llm #minimax_m1 #reasoning_models
Stars: 328 Issues: 3 Forks: 9
https://github.com/MiniMax-AI/MiniMax-M1
GitHub
  
  GitHub - MiniMax-AI/MiniMax-M1: MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.
  MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model. - MiniMax-AI/MiniMax-M1
❤2
  NVlabs/Long-RL
Long-RL: Scaling RL to Long Sequences
Language: Python
#efficient_ai #large_language_models #long_sequence #multi_modality #reinforcement_learning #sequence_parallelism
Stars: 301 Issues: 2 Forks: 3
https://github.com/NVlabs/Long-RL
  
  Long-RL: Scaling RL to Long Sequences
Language: Python
#efficient_ai #large_language_models #long_sequence #multi_modality #reinforcement_learning #sequence_parallelism
Stars: 301 Issues: 2 Forks: 3
https://github.com/NVlabs/Long-RL
GitHub
  
  GitHub - NVlabs/Long-RL: Long-RL: Scaling RL to Long Sequences
  Long-RL: Scaling RL to Long Sequences. Contribute to NVlabs/Long-RL development by creating an account on GitHub.
  Relaxed-System-Lab/Flash-Sparse-Attention
🚀🚀 Efficient implementations of Native Sparse Attention
Language: Python
#kernels #large_language_models #machine_learning_systems
Stars: 237 Issues: 1 Forks: 3
https://github.com/Relaxed-System-Lab/Flash-Sparse-Attention
  
  🚀🚀 Efficient implementations of Native Sparse Attention
Language: Python
#kernels #large_language_models #machine_learning_systems
Stars: 237 Issues: 1 Forks: 3
https://github.com/Relaxed-System-Lab/Flash-Sparse-Attention
GitHub
  
  GitHub - Relaxed-System-Lab/Flash-Sparse-Attention: 🚀🚀 Efficient implementations of Native Sparse Attention
  🚀🚀 Efficient implementations of Native Sparse Attention - Relaxed-System-Lab/Flash-Sparse-Attention
  pengzhangzhi/Open-dLLM
The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.
Language: Python
#diffusion_models #large_language_models
Stars: 159 Issues: 3 Forks: 5
https://github.com/pengzhangzhi/Open-dLLM
  
  The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.
Language: Python
#diffusion_models #large_language_models
Stars: 159 Issues: 3 Forks: 5
https://github.com/pengzhangzhi/Open-dLLM
GitHub
  
  GitHub - pengzhangzhi/Open-dLLM: The most open diffusion language model for code generation — releasing pretraining, evaluation…
  The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints. - pengzhangzhi/Open-dLLM
  