SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
📝https://github.com/stability-ai/generative-models
📝https://github.com/stability-ai/generative-models
GitHub
GitHub - Stability-AI/generative-models: Generative Models by Stability AI
Generative Models by Stability AI. Contribute to Stability-AI/generative-models development by creating an account on GitHub.
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
📝https://github.com/guoyww/animatediff
📝https://github.com/guoyww/animatediff
GitHub
GitHub - guoyww/AnimateDiff: Official implementation of AnimateDiff.
Official implementation of AnimateDiff. Contribute to guoyww/AnimateDiff development by creating an account on GitHub.
Semantic-SAM: Segment and Recognize Anything at Any Granularity
📝https://github.com/ux-decoder/semantic-sam
📝https://github.com/ux-decoder/semantic-sam
GitHub
GitHub - UX-Decoder/Semantic-SAM: Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity" - UX-Decoder/Semantic-SAM
Petals: Collaborative Inference and Fine-tuning of Large Models
📝https://github.com/bigscience-workshop/petals
📝https://github.com/bigscience-workshop/petals
GitHub
GitHub - bigscience-workshop/petals: 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading - bigscience-workshop/petals
DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection for Conversational AI
📝https://github.com/salesforce/DialogStudio
📝https://github.com/salesforce/DialogStudio
GitHub
GitHub - salesforce/DialogStudio: DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware…
DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI - salesforce/DialogStudio
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
📝https://github.com/hazyresearch/flash-attention
📝https://github.com/hazyresearch/flash-attention
GitHub
GitHub - Dao-AILab/flash-attention: Fast and memory-efficient exact attention
Fast and memory-efficient exact attention. Contribute to Dao-AILab/flash-attention development by creating an account on GitHub.
Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image
🖥 https://github.com/yvanyin/metric3d
Please open Telegram to view this post
VIEW IN TELEGRAM
GitHub
GitHub - YvanYin/Metric3D: The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2:…
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..." - YvanYin/Metric3D
Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors
🖥 https://github.com/guochengqian/magic123
Please open Telegram to view this post
VIEW IN TELEGRAM
GitHub
GitHub - guochengqian/Magic123: Official PyTorch Implementation of Magic123: One Image to High-Quality 3D Object Generation Using…
Official PyTorch Implementation of Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors - GitHub - guochengqian/Magic123: Official PyTorch Implementation o...
WavJourney: Compositional Audio Creation with Large Language Models
🖥 https://github.com/audio-agi/wavjourney
Please open Telegram to view this post
VIEW IN TELEGRAM
GitHub
GitHub - Audio-AGI/WavJourney: WavJourney: Compositional Audio Creation with LLMs
WavJourney: Compositional Audio Creation with LLMs - GitHub - Audio-AGI/WavJourney: WavJourney: Compositional Audio Creation with LLMs
Foundational Models Defining a New Era in Vision: A Survey and Outlook
🖥 https://github.com/awaisrauf/awesome-cv-foundational-models
Please open Telegram to view this post
VIEW IN TELEGRAM
GitHub
GitHub - awaisrauf/Awesome-CV-Foundational-Models
Contribute to awaisrauf/Awesome-CV-Foundational-Models development by creating an account on GitHub.
Universal and Transferable Adversarial Attacks on Aligned Language Models
📝https://github.com/llm-attacks/llm-attacks
📝https://github.com/llm-attacks/llm-attacks
GitHub
GitHub - llm-attacks/llm-attacks: Universal and Transferable Attacks on Aligned Language Models
Universal and Transferable Attacks on Aligned Language Models - llm-attacks/llm-attacks
Seal-3D: Interactive Pixel-Level Editing for Neural Radiance Fields
📝https://github.com/windingwind/seal-3d
📝https://github.com/windingwind/seal-3d
GitHub
GitHub - windingwind/seal-3d: The official implementation of the paper Seal-3D: Interactive Pixel-Level Editing for Neural Radiance…
The official implementation of the paper Seal-3D: Interactive Pixel-Level Editing for Neural Radiance Fields, the first interactive pixel-level NeRF editing tool. - GitHub - windingwind/seal-3d: Th...
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs
📝https://github.com/openbmb/toolbench
📝https://github.com/openbmb/toolbench
GitHub
GitHub - OpenBMB/ToolBench: [ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for…
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning. - OpenBMB/ToolBench
Effective Whole-body Pose Estimation with Two-stages Distillation
📝https://github.com/idea-research/dwpose
📝https://github.com/idea-research/dwpose
GitHub
GitHub - IDEA-Research/DWPose: "Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)
"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop) - GitHub - IDEA-Research/DWPose: "Effective Whole-body Pose Estimat...