SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
📝https://github.com/facebookresearch/seamless_communication
📝https://github.com/facebookresearch/seamless_communication
GitHub
GitHub - facebookresearch/seamless_communication: Foundational Models for State-of-the-Art Speech and Text Translation
Foundational Models for State-of-the-Art Speech and Text Translation - facebookresearch/seamless_communication
NeuS: Learning Neural Implicit Surfaces by Volume Rendering for Multi-view Reconstruction
📝https://github.com/PJLab-ADG/neuralsim/blob/main/docs/methods/neus_in_minutes.md
📝https://github.com/PJLab-ADG/neuralsim/blob/main/docs/methods/neus_in_minutes.md
GitHub
neuralsim/docs/methods/neus_in_minutes.md at main · PJLab-ADG/neuralsim
neuralsim: 3D surface reconstruction and simulation based on 3D neural rendering. - PJLab-ADG/neuralsim
StreetSurf: Extending Multi-view Implicit Surface Reconstruction to Street Views
📝https://github.com/pjlab-ADG/neuralsim
📝https://github.com/pjlab-ADG/neuralsim
GitHub
GitHub - PJLab-ADG/neuralsim: neuralsim: 3D surface reconstruction and simulation based on 3D neural rendering.
neuralsim: 3D surface reconstruction and simulation based on 3D neural rendering. - GitHub - PJLab-ADG/neuralsim: neuralsim: 3D surface reconstruction and simulation based on 3D neural rendering.
IT3D: Improved Text-to-3D Generation with Explicit View Synthesis
📝https://github.com/buaacyw/it3d-text-to-3d
📝https://github.com/buaacyw/it3d-text-to-3d
GitHub
GitHub - buaacyw/IT3D-text-to-3D: [AAAI'2024] IT3D: Improved Text-to-3D Generation with Explicit View Synthesis
[AAAI'2024] IT3D: Improved Text-to-3D Generation with Explicit View Synthesis - GitHub - buaacyw/IT3D-text-to-3D: [AAAI'2024] IT3D: Improved Text-to-3D Generation with Explicit View Synthesis
Prompt2Model: Generating Deployable Models from Natural Language Instructions
📝https://github.com/neulab/prompt2model
📝https://github.com/neulab/prompt2model
GitHub
GitHub - neulab/prompt2model: prompt2model - Generate Deployable Models from Natural Language Instructions
prompt2model - Generate Deployable Models from Natural Language Instructions - neulab/prompt2model
Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities
📝https://github.com/qwenlm/qwen-vl
📝https://github.com/qwenlm/qwen-vl
GitHub
GitHub - QwenLM/Qwen-VL: The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba…
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud. - QwenLM/Qwen-VL
OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models
📝https://github.com/opengvlab/omniquant
📝https://github.com/opengvlab/omniquant
GitHub
GitHub - OpenGVLab/OmniQuant: [ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.
[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs. - GitHub - OpenGVLab/OmniQuant: [ICLR2024 spotlight] OmniQuant is a simple and powerful quantization techni...
TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition
📝https://github.com/Shilin-LU/TF-ICON
📝https://github.com/Shilin-LU/TF-ICON
GitHub
GitHub - Shilin-LU/TF-ICON: [ICCV 2023] "TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition" (Official Implementation)
[ICCV 2023] "TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition" (Official Implementation) - Shilin-LU/TF-ICON
Nougat: Neural Optical Understanding for Academic Documents
📝https://github.com/facebookresearch/nougat
📝https://github.com/facebookresearch/nougat
GitHub
GitHub - facebookresearch/nougat: Implementation of Nougat Neural Optical Understanding for Academic Documents
Implementation of Nougat Neural Optical Understanding for Academic Documents - facebookresearch/nougat
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
📝https://github.com/nlpxucan/wizardlm
📝https://github.com/nlpxucan/wizardlm
GitHub
GitHub - nlpxucan/WizardLM: LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath - nlpxucan/WizardLM
A Survey on Large Language Model based Autonomous Agents
📝https://github.com/paitesanshi/llm-agent-survey
📝https://github.com/paitesanshi/llm-agent-survey
GitHub
GitHub - Paitesanshi/LLM-Agent-Survey
Contribute to Paitesanshi/LLM-Agent-Survey development by creating an account on GitHub.
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
📝https://github.com/haoheliu/AudioLDM2
📝https://github.com/haoheliu/AudioLDM2
GitHub
GitHub - haoheliu/AudioLDM2: Text-to-Audio/Music Generation
Text-to-Audio/Music Generation. Contribute to haoheliu/AudioLDM2 development by creating an account on GitHub.
PointLLM: Empowering Large Language Models to Understand Point Clouds
📝https://github.com/openrobotlab/pointllm
📝https://github.com/openrobotlab/pointllm
GitHub
GitHub - OpenRobotLab/PointLLM: [arXiv 2023] PointLLM: Empowering Large Language Models to Understand Point Clouds
[arXiv 2023] PointLLM: Empowering Large Language Models to Understand Point Clouds - GitHub - OpenRobotLab/PointLLM: [arXiv 2023] PointLLM: Empowering Large Language Models to Understand Point Clouds
DISC-MedLLM: Bridging General Large Language Models and Real-World Medical Consultation
📝https://github.com/fudandisc/disc-medllm
📝https://github.com/fudandisc/disc-medllm
GitHub
GitHub - FudanDISC/DISC-MedLLM: Repository of DISC-MedLLM, it is a comprehensive solution that leverages Large Language Models…
Repository of DISC-MedLLM, it is a comprehensive solution that leverages Large Language Models (LLMs) to provide accurate and truthful medical response in end-to-end conversational healthcare servi...
FaceChain: A Playground for Identity-Preserving Portrait Generation
📝https://github.com/modelscope/facechain
📝https://github.com/modelscope/facechain
GitHub
GitHub - modelscope/facechain: FaceChain is a deep-learning toolchain for generating your Digital-Twin.
FaceChain is a deep-learning toolchain for generating your Digital-Twin. - modelscope/facechain
High-Resolution Document Shadow Removal via A Large-Scale Real-World Dataset and A Frequency-Aware Shadow Erasing Net
📝https://github.com/CXH-Research/DocShadow-SD7K
📝https://github.com/CXH-Research/DocShadow-SD7K
GitHub
GitHub - CXH-Research/DocShadow-SD7K: [ICCV 2023] A large-scale high-resolution dataset satisfies all important data features about…
[ICCV 2023] A large-scale high-resolution dataset satisfies all important data features about document shadow, covers a large number of document shadow images. - CXH-Research/DocShadow-SD7K