Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence
🖥https://github.com/openbmb/ioa
  
  🖥https://github.com/openbmb/ioa
GitHub
  
  GitHub - OpenBMB/IoA: An open-source framework for collaborative AI agents, enabling diverse, distributed agents to team up and…
  An open-source framework for collaborative AI agents, enabling diverse, distributed agents to team up and tackle complex tasks through internet-like connectivity. - OpenBMB/IoA
  FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs
🖥https://github.com/funaudiollm/cosyvoice
  
  🖥https://github.com/funaudiollm/cosyvoice
GitHub
  
  GitHub - FunAudioLLM/CosyVoice: Multi-lingual large voice generation model, providing inference, training and deployment full-stack…
  Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability. - FunAudioLLM/CosyVoice
  Cradle: Empowering Foundation Agents Towards General Computer Control
🖥https://github.com/baai-agents/cradle
  
  🖥https://github.com/baai-agents/cradle
GitHub
  
  GitHub - BAAI-Agents/Cradle: The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents…
  The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curatio...
  Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models
🖥https://github.com/stanford-oval/storm
  
  🖥https://github.com/stanford-oval/storm
GitHub
  
  GitHub - stanford-oval/storm: An LLM-powered knowledge curation system that researches a topic and generates a full-length report…
  An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations. - stanford-oval/storm
  GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and Localization
🖥https://github.com/chenyirui/gim
  
  🖥https://github.com/chenyirui/gim
GitHub
  
  GitHub - chenyirui/GIM: This repository is the official repository of the GIM.
  This repository is the official repository of the GIM. - chenyirui/GIM
  Fundus: A Simple-to-Use News Scraper Optimized for High Quality Extractions
🖥https://github.com/flairnlp/fundus
  
  🖥https://github.com/flairnlp/fundus
GitHub
  
  GitHub - flairNLP/fundus: A very simple news crawler with a funny name
  A very simple news crawler with a funny name. Contribute to flairNLP/fundus development by creating an account on GitHub.
  VGGSfM: Visual Geometry Grounded Deep Structure From Motion
🖥https://github.com/facebookresearch/vggsfm
  
  🖥https://github.com/facebookresearch/vggsfm
GitHub
  
  GitHub - facebookresearch/vggsfm: VGGSfM: Visual Geometry Grounded Deep Structure From Motion
  VGGSfM: Visual Geometry Grounded Deep Structure From Motion - facebookresearch/vggsfm
  DataComp-LM: In search of the next generation of training sets for language models
🖥https://github.com/mlfoundations/dclm
  
  🖥https://github.com/mlfoundations/dclm
GitHub
  
  GitHub - mlfoundations/dclm: DataComp for Language Models
  DataComp for Language Models. Contribute to mlfoundations/dclm development by creating an account on GitHub.
  OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
🖥https://github.com/wanghao9610/ov-dino
  
  🖥https://github.com/wanghao9610/ov-dino
GitHub
  
  GitHub - wanghao9610/OV-DINO: Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective…
  Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion - wanghao9610/OV-DINO
  "Do Anything Now": Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models
🖥https://github.com/verazuo/jailbreak_llms
  
  🖥https://github.com/verazuo/jailbreak_llms
GitHub
  
  GitHub - verazuo/jailbreak_llms: [CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open…
  [CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts). - verazuo/jailbreak_llms
  LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders
🖥https://github.com/mcgill-nlp/llm2vec
  
  🖥https://github.com/mcgill-nlp/llm2vec
GitHub
  
  GitHub - McGill-NLP/llm2vec: Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
  Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders' - McGill-NLP/llm2vec
  Arbitrary-Scale Video Super-Resolution with Structural and Textural Priors
🖥https://github.com/shangwei5/st-avsr
  
  🖥https://github.com/shangwei5/st-avsr
GitHub
  
  GitHub - shangwei5/ST-AVSR: Arbitrary-Scale Video Super-Resolution with Structural and Textural Priors (ECCV2024)
  Arbitrary-Scale Video Super-Resolution with Structural and Textural Priors (ECCV2024) - shangwei5/ST-AVSR
  Neural General Circulation Models for Weather and Climate
🖥https://github.com/google-research/neuralgcm
  
  🖥https://github.com/google-research/neuralgcm
GitHub
  
  GitHub - neuralgcm/neuralgcm: Hybrid ML + physics model of the Earth's atmosphere
  Hybrid ML + physics model of the Earth's atmosphere - neuralgcm/neuralgcm
  CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers
🖥https://github.com/thudm/cogvideo
  
  🖥https://github.com/thudm/cogvideo
GitHub
  
  GitHub - THUDM/CogVideo: text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
  text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023) - THUDM/CogVideo
  Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained Transformers
🖥https://github.com/liruiw/HPT
  
  🖥https://github.com/liruiw/HPT
GitHub
  
  GitHub - liruiw/HPT: Heterogeneous Pre-trained Transformer (HPT) as Scalable Policy Learner.
  Heterogeneous Pre-trained Transformer (HPT) as Scalable Policy Learner. - liruiw/HPT
  TensorIR: An Abstraction for Automatic Tensorized Program Optimization
🖥https://github.com/mlc-ai/web-llm
  
  🖥https://github.com/mlc-ai/web-llm
GitHub
  
  GitHub - mlc-ai/web-llm: High-performance In-browser LLM Inference Engine
  High-performance In-browser LLM Inference Engine . Contribute to mlc-ai/web-llm development by creating an account on GitHub.
  