TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition
📝https://github.com/Shilin-LU/TF-ICON
📝https://github.com/Shilin-LU/TF-ICON
GitHub
GitHub - Shilin-LU/TF-ICON: [ICCV 2023] "TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition" (Official Implementation)
[ICCV 2023] "TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition" (Official Implementation) - Shilin-LU/TF-ICON
Nougat: Neural Optical Understanding for Academic Documents
📝https://github.com/facebookresearch/nougat
📝https://github.com/facebookresearch/nougat
GitHub
GitHub - facebookresearch/nougat: Implementation of Nougat Neural Optical Understanding for Academic Documents
Implementation of Nougat Neural Optical Understanding for Academic Documents - facebookresearch/nougat
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
📝https://github.com/nlpxucan/wizardlm
📝https://github.com/nlpxucan/wizardlm
GitHub
GitHub - nlpxucan/WizardLM: LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath - nlpxucan/WizardLM
A Survey on Large Language Model based Autonomous Agents
📝https://github.com/paitesanshi/llm-agent-survey
📝https://github.com/paitesanshi/llm-agent-survey
GitHub
GitHub - Paitesanshi/LLM-Agent-Survey
Contribute to Paitesanshi/LLM-Agent-Survey development by creating an account on GitHub.
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
📝https://github.com/haoheliu/AudioLDM2
📝https://github.com/haoheliu/AudioLDM2
GitHub
GitHub - haoheliu/AudioLDM2: Text-to-Audio/Music Generation
Text-to-Audio/Music Generation. Contribute to haoheliu/AudioLDM2 development by creating an account on GitHub.
PointLLM: Empowering Large Language Models to Understand Point Clouds
📝https://github.com/openrobotlab/pointllm
📝https://github.com/openrobotlab/pointllm
GitHub
GitHub - OpenRobotLab/PointLLM: [arXiv 2023] PointLLM: Empowering Large Language Models to Understand Point Clouds
[arXiv 2023] PointLLM: Empowering Large Language Models to Understand Point Clouds - GitHub - OpenRobotLab/PointLLM: [arXiv 2023] PointLLM: Empowering Large Language Models to Understand Point Clouds
DISC-MedLLM: Bridging General Large Language Models and Real-World Medical Consultation
📝https://github.com/fudandisc/disc-medllm
📝https://github.com/fudandisc/disc-medllm
GitHub
GitHub - FudanDISC/DISC-MedLLM: Repository of DISC-MedLLM, it is a comprehensive solution that leverages Large Language Models…
Repository of DISC-MedLLM, it is a comprehensive solution that leverages Large Language Models (LLMs) to provide accurate and truthful medical response in end-to-end conversational healthcare servi...
FaceChain: A Playground for Identity-Preserving Portrait Generation
📝https://github.com/modelscope/facechain
📝https://github.com/modelscope/facechain
GitHub
GitHub - modelscope/facechain: FaceChain is a deep-learning toolchain for generating your Digital-Twin.
FaceChain is a deep-learning toolchain for generating your Digital-Twin. - modelscope/facechain
High-Resolution Document Shadow Removal via A Large-Scale Real-World Dataset and A Frequency-Aware Shadow Erasing Net
📝https://github.com/CXH-Research/DocShadow-SD7K
📝https://github.com/CXH-Research/DocShadow-SD7K
GitHub
GitHub - CXH-Research/DocShadow-SD7K: [ICCV 2023] A large-scale high-resolution dataset satisfies all important data features about…
[ICCV 2023] A large-scale high-resolution dataset satisfies all important data features about document shadow, covers a large number of document shadow images. - CXH-Research/DocShadow-SD7K
YaRN: Efficient Context Window Extension of Large Language Models
📝https://github.com/jquesnelle/yarn
📝https://github.com/jquesnelle/yarn
GitHub
GitHub - jquesnelle/yarn: YaRN: Efficient Context Window Extension of Large Language Models
YaRN: Efficient Context Window Extension of Large Language Models - jquesnelle/yarn
Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following
📝https://github.com/ziyuguo99/point-bind_point-llm
📝https://github.com/ziyuguo99/point-bind_point-llm
GitHub
GitHub - ZiyuGuo99/Point-Bind_Point-LLM: Align 3D Point Cloud with Multi-modalities for Large Language Models
Align 3D Point Cloud with Multi-modalities for Large Language Models - GitHub - ZiyuGuo99/Point-Bind_Point-LLM: Align 3D Point Cloud with Multi-modalities for Large Language Models
Relay Diffusion: Unifying diffusion process across resolutions for image synthesis
📝https://github.com/THUDM/RelayDiffusion
📝https://github.com/THUDM/RelayDiffusion
GitHub
GitHub - THUDM/RelayDiffusion: The official implementation of "Relay Diffusion: Unifying diffusion process across resolutions for…
The official implementation of "Relay Diffusion: Unifying diffusion process across resolutions for image synthesis" - GitHub - THUDM/RelayDiffusion: The official implementation of...
PyGraft: Configurable Generation of Schemas and Knowledge Graphs at Your Fingertips
📝https://github.com/nicolas-hbt/pygraft
📝https://github.com/nicolas-hbt/pygraft
GitHub
GitHub - nicolas-hbt/pygraft: Configurable Generation of Synthetic Schemas and Knowledge Graphs at Your Fingertips
Configurable Generation of Synthetic Schemas and Knowledge Graphs at Your Fingertips - nicolas-hbt/pygraft
AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors in Agents
📝https://github.com/openbmb/agentverse
📝https://github.com/openbmb/agentverse
GitHub
GitHub - OpenBMB/AgentVerse: 🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications…
🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation - GitHub - OpenBMB...
Tracking Anything with Decoupled Video Segmentation
📝https://github.com/hkchengrex/Tracking-Anything-with-DEVA
📝https://github.com/hkchengrex/Tracking-Anything-with-DEVA
GitHub
GitHub - hkchengrex/Tracking-Anything-with-DEVA: [ICCV 2023] Tracking Anything with Decoupled Video Segmentation
[ICCV 2023] Tracking Anything with Decoupled Video Segmentation - hkchengrex/Tracking-Anything-with-DEVA
Break-A-Scene: Extracting Multiple Concepts from a Single Image
📝https://github.com/google/break-a-scene
📝https://github.com/google/break-a-scene
GitHub
GitHub - google/break-a-scene: Official implementation for "Break-A-Scene: Extracting Multiple Concepts from a Single Image" [SIGGRAPH…
Official implementation for "Break-A-Scene: Extracting Multiple Concepts from a Single Image" [SIGGRAPH Asia 2023] - google/break-a-scene
InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation
📝https://github.com/gnobitab/instaflow
📝https://github.com/gnobitab/instaflow
GitHub
GitHub - gnobitab/InstaFlow: :zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)
:zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024) - gnobitab/InstaFlow