SegGPT: Segmenting Everything In Context
📝We unify various segmentation tasks into a generalist in-context learning framework that accommodates different kinds of segmentation data by transforming them into the same format of images.https://github.com/baaivision/painter
📝We unify various segmentation tasks into a generalist in-context learning framework that accommodates different kinds of segmentation data by transforming them into the same format of images.https://github.com/baaivision/painter
GitHub
GitHub - baaivision/Painter: Painter & SegGPT Series: Vision Foundation Models from BAAI
Painter & SegGPT Series: Vision Foundation Models from BAAI - GitHub - baaivision/Painter: Painter & SegGPT Series: Vision Foundation Models from BAAI
Instruction Tuning with GPT-4
📝Prior work has shown that finetuning large language models (LLMs) using machine-generated instruction-following data enables such models to achieve remarkable zero-shot capabilities on new tasks, and no human-written instructions are needed.https://github.com/Instruction-Tuning-with-GPT-4/GPT-4-LLM
📝Prior work has shown that finetuning large language models (LLMs) using machine-generated instruction-following data enables such models to achieve remarkable zero-shot capabilities on new tasks, and no human-written instructions are needed.https://github.com/Instruction-Tuning-with-GPT-4/GPT-4-LLM
GitHub
GitHub - Instruction-Tuning-with-GPT-4/GPT-4-LLM: Instruction Tuning with GPT-4
Instruction Tuning with GPT-4. Contribute to Instruction-Tuning-with-GPT-4/GPT-4-LLM development by creating an account on GitHub.
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace
📝Solving complicated AI tasks with different domains and modalities is a key step toward advanced artificial intelligence.https://github.com/microsoft/JARVIS
📝Solving complicated AI tasks with different domains and modalities is a key step toward advanced artificial intelligence.https://github.com/microsoft/JARVIS
GitHub
GitHub - microsoft/JARVIS: JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf - microsoft/JARVIS
Segment Anything
📝We introduce the Segment Anything (SA) project: a new task, model, and dataset for image segmentation.https://github.com/facebookresearch/segment-anything
📝We introduce the Segment Anything (SA) project: a new task, model, and dataset for image segmentation.https://github.com/facebookresearch/segment-anything
GitHub
GitHub - facebookresearch/segment-anything: The repository provides code for running inference with the SegmentAnything Model (SAM)…
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model. -...
GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers
📝In this paper, we address this challenge, and propose GPTQ, a new one-shot weight quantization method based on approximate second-order information, that is both highly-accurate and highly-efficient.https://github.com/thudm/chatglm-6b
📝In this paper, we address this challenge, and propose GPTQ, a new one-shot weight quantization method based on approximate second-order information, that is both highly-accurate and highly-efficient.https://github.com/thudm/chatglm-6b
GitHub
GitHub - THUDM/ChatGLM-6B: ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型 - GitHub - THUDM/ChatGLM-6B: ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation📝We present SadTalker, which generates 3D motion coefficients (head pose, expression) of the 3DMM from audio and implicitly modulates a novel 3D-aware face render for talking head generation.https://github.com/winfredy/sadtalker
Segment Everything Everywhere All at Once
📝https://github.com/ux-decoder/segment-everything-everywhere-all-at-once
Segment Everything Everywhere All at Once
📝https://github.com/ux-decoder/segment-everything-everywhere-all-at-once
GitHub
GitHub - OpenTalker/SadTalker: [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single…
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation - OpenTalker/SadTalker
ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation
📝https://github.com/thudm/imagereward
📝https://github.com/thudm/imagereward
GitHub
GitHub - THUDM/ImageReward: ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation - GitHub - THUDM/ImageReward: ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
Self-Instruct: Aligning Language Model with Self Generated Instructions
📝https://github.com/databrickslabs/dolly
📝https://github.com/databrickslabs/dolly
GitHub
GitHub - databrickslabs/dolly: Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform - databrickslabs/dolly
CAMEL: Communicative Agents for "Mind" Exploration of Large Scale Language Model Society
📝https://github.com/lightaime/camel
📝https://github.com/lightaime/camel
GitHub
GitHub - camel-ai/camel: 🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel…
🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org - camel-ai/camel
AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models
📝https://github.com/microsoft/agieval
📝https://github.com/microsoft/agieval
GitHub
GitHub - microsoft/AGIEval
Contribute to microsoft/AGIEval development by creating an account on GitHub.
A Method for Animating Children's Drawings of the Human Figure
📝https://github.com/facebookresearch/AnimatedDrawings
📝https://github.com/facebookresearch/AnimatedDrawings
GitHub
GitHub - facebookresearch/AnimatedDrawings: Code to accompany "A Method for Animating Children's Drawings of the Human Figure"
Code to accompany "A Method for Animating Children's Drawings of the Human Figure" - facebookresearch/AnimatedDrawings
OpenAssistant Conversations -- Democratizing Large Language Model Alignment
📝https://github.com/laion-ai/open-assistant
📝https://github.com/laion-ai/open-assistant
GitHub
GitHub - LAION-AI/Open-Assistant: OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party…
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so. - LAION-AI/Open-Assistant
HuaTuo: Tuning LLaMA Model with Chinese Medical Knowledge
📝https://github.com/scir-hi/huatuo-llama-med-chinese
📝https://github.com/scir-hi/huatuo-llama-med-chinese
GitHub
GitHub - SCIR-HI/Huatuo-Llama-Med-Chinese: Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models…
Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调 - GitHub - SCIR-HI/Huatuo-Llama-Med-Chinese:...
Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved With Text
📝https://github.com/allenai/mmc4
📝https://github.com/allenai/mmc4
GitHub
GitHub - allenai/mmc4: MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text. - allenai/mmc4
Inpaint Anything: Segment Anything Meets Image Inpainting
📝https://github.com/geekyutao/inpaint-anything
📝https://github.com/geekyutao/inpaint-anything
GitHub
GitHub - geekyutao/Inpaint-Anything: Inpaint anything using Segment Anything and inpainting models.
Inpaint anything using Segment Anything and inpainting models. - geekyutao/Inpaint-Anything
Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models
📝https://github.com/lupantech/chameleon-llm
📝https://github.com/lupantech/chameleon-llm
GitHub
GitHub - lupantech/chameleon-llm: Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".
Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models". - lupantech/chameleon-llm
DINOv2: Learning Robust Visual Features without Supervision
📝https://github.com/facebookresearch/dinov2
📝https://github.com/facebookresearch/dinov2
GitHub
GitHub - facebookresearch/dinov2: PyTorch code and models for the DINOv2 self-supervised learning method.
PyTorch code and models for the DINOv2 self-supervised learning method. - facebookresearch/dinov2
Transformer-Based Visual Segmentation: A Survey
📝https://github.com/lxtgh/awesome-segmenation-with-transformer
📝https://github.com/lxtgh/awesome-segmenation-with-transformer
GitHub
GitHub - lxtGH/Awesome-Segmenation-With-Transformer: Transformer-Based Visual Segmentation: A Survey
Transformer-Based Visual Segmentation: A Survey. Contribute to lxtGH/Awesome-Segmenation-With-Transformer development by creating an account on GitHub.