A Gentle Introduction to Conformal Prediction and Distribution-Free Uncertainty Quantification
📝Conformal prediction is a user-friendly paradigm for creating statistically rigorous uncertainty sets/intervals for the predictions of such models.
https://github.com/aangelopoulos/conformal-prediction
📝Conformal prediction is a user-friendly paradigm for creating statistically rigorous uncertainty sets/intervals for the predictions of such models.
https://github.com/aangelopoulos/conformal-prediction
GitHub
GitHub - aangelopoulos/conformal-prediction: Lightweight, useful implementation of conformal prediction on real data.
Lightweight, useful implementation of conformal prediction on real data. - aangelopoulos/conformal-prediction
👍1
Transformers are Sample Efficient World Models
📝Deep reinforcement learning agents are notoriously sample inefficient, which considerably limits their application to real-world problems.
https://github.com/eloialonso/iris
📝Deep reinforcement learning agents are notoriously sample inefficient, which considerably limits their application to real-world problems.
https://github.com/eloialonso/iris
GitHub
GitHub - eloialonso/iris: Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.
Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%. - eloialonso/iris
👍1
Text-Free Learning of a Natural Language Interface for Pretrained Face Generators
📝We propose Fast text2StyleGAN, a natural language interface that adapts pre-trained GANs for text-guided human face synthesis.
https://github.com/duxiaodan/fast_text2stylegan
📝We propose Fast text2StyleGAN, a natural language interface that adapts pre-trained GANs for text-guided human face synthesis.
https://github.com/duxiaodan/fast_text2stylegan
GitHub
GitHub - duxiaodan/Fast_text2StyleGAN: Official repo of Text-Free Learning of a Natural Language Interface for Pretrained Face…
Official repo of Text-Free Learning of a Natural Language Interface for Pretrained Face Generators - duxiaodan/Fast_text2StyleGAN
👍1
Behavior Trees in Robotics and AI: An Introduction
📝A Behavior Tree (BT) is a way to structure the switching between different tasks in an autonomous agent, such as a robot or a virtual entity in a computer game.
https://github.com/BehaviorTree/BehaviorTree.CPP
📝A Behavior Tree (BT) is a way to structure the switching between different tasks in an autonomous agent, such as a robot or a virtual entity in a computer game.
https://github.com/BehaviorTree/BehaviorTree.CPP
GitHub
GitHub - BehaviorTree/BehaviorTree.CPP: Behavior Trees Library in C++. Batteries included.
Behavior Trees Library in C++. Batteries included. - BehaviorTree/BehaviorTree.CPP
👍2
FedBN: Federated Learning on Non-IID Features via Local Batch Normalization
📝The emerging paradigm of federated learning (FL) strives to enable collaborative training of deep models on the network edge without centrally aggregating raw data and hence improving data privacy.
https://github.com/adap/flower
📝The emerging paradigm of federated learning (FL) strives to enable collaborative training of deep models on the network edge without centrally aggregating raw data and hence improving data privacy.
https://github.com/adap/flower
GitHub
GitHub - adap/flower: Flower: A Friendly Federated AI Framework
Flower: A Friendly Federated AI Framework. Contribute to adap/flower development by creating an account on GitHub.
👍1
ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-Speech
📝Through the preliminary study on diffusion model parameterization, we find that previous gradient-based TTS models require hundreds or thousands of iterations to guarantee high sample quality, which poses a challenge for accelerating sampling.
https://github.com/Rongjiehuang/ProDiff
📝Through the preliminary study on diffusion model parameterization, we find that previous gradient-based TTS models require hundreds or thousands of iterations to guarantee high sample quality, which poses a challenge for accelerating sampling.
https://github.com/Rongjiehuang/ProDiff
GitHub
GitHub - Rongjiehuang/ProDiff: PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline
PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline - Rongjiehuang/ProDiff
👍1
Text2Light: Zero-Shot Text-Driven HDR Panorama Generation
📝To achieve super-resolution inverse tone mapping, we derive a continuous representation of 360-degree imaging from the LDR panorama as a set of structured latent codes anchored to the sphere.
https://github.com/frozenburning/text2light
📝To achieve super-resolution inverse tone mapping, we derive a continuous representation of 360-degree imaging from the LDR panorama as a set of structured latent codes anchored to the sphere.
https://github.com/frozenburning/text2light
GitHub
GitHub - FrozenBurning/Text2Light: [SIGGRAPH Asia 2022] Text2Light: Zero-Shot Text-Driven HDR Panorama Generation
[SIGGRAPH Asia 2022] Text2Light: Zero-Shot Text-Driven HDR Panorama Generation - GitHub - FrozenBurning/Text2Light: [SIGGRAPH Asia 2022] Text2Light: Zero-Shot Text-Driven HDR Panorama Generation
👍2
SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation
📝Notably, SegNeXt outperforms EfficientNet-L2 w/ NAS-FPN and achieves 90. 6% mIoU on the Pascal VOC 2012 test leaderboard using only 1/10 parameters of it.
https://github.com/visual-attention-network/segnext
📝Notably, SegNeXt outperforms EfficientNet-L2 w/ NAS-FPN and achieves 90. 6% mIoU on the Pascal VOC 2012 test leaderboard using only 1/10 parameters of it.
https://github.com/visual-attention-network/segnext
GitHub
GitHub - Visual-Attention-Network/SegNeXt: Official Pytorch implementations for "SegNeXt: Rethinking Convolutional Attention Design…
Official Pytorch implementations for "SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation" (NeurIPS 2022) - Visual-Attention-Network/SegNeXt
Robust Speech Recognition via Large-Scale Weak Supervision
📝We study the capabilities of speech processing systems trained simply to predict large amounts of transcripts of audio on the internet.
https://github.com/openai/whisper
📝We study the capabilities of speech processing systems trained simply to predict large amounts of transcripts of audio on the internet.
https://github.com/openai/whisper
GitHub
GitHub - openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision
Robust Speech Recognition via Large-Scale Weak Supervision - openai/whisper
Diffusion Models: A Comprehensive Survey of Methods and Applications
📝Diffusion models are a class of deep generative models that have shown impressive results on various tasks with a solid theoretical foundation.
https://github.com/YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy
📝Diffusion models are a class of deep generative models that have shown impressive results on various tasks with a solid theoretical foundation.
https://github.com/YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy
GitHub
GitHub - YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy: Diffusion model papers, survey, and taxonomy
Diffusion model papers, survey, and taxonomy. Contribute to YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy development by creating an account on GitHub.
Plenoxels: Radiance Fields without Neural Networks
📝We introduce Plenoxels (plenoptic voxels), a system for photorealistic view synthesis.
https://github.com/kakaobrain/NeRF-Factory
📝We introduce Plenoxels (plenoptic voxels), a system for photorealistic view synthesis.
https://github.com/kakaobrain/NeRF-Factory
GitHub
GitHub - kakaobrain/nerf-factory: An awesome PyTorch NeRF library
An awesome PyTorch NeRF library. Contribute to kakaobrain/nerf-factory development by creating an account on GitHub.
LAVIS: A Library for Language-Vision Intelligence
📝We introduce LAVIS, an open-source deep learning library for LAnguage-VISion research and applications.
https://github.com/salesforce/lavis
📝We introduce LAVIS, an open-source deep learning library for LAnguage-VISion research and applications.
https://github.com/salesforce/lavis
GitHub
GitHub - salesforce/LAVIS: LAVIS - A One-stop Library for Language-Vision Intelligence
LAVIS - A One-stop Library for Language-Vision Intelligence - salesforce/LAVIS
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection
📝Compared to other models on the leaderboard, DINO significantly reduces its model size and pre-training data size while achieving better results.
https://github.com/IDEA-Research/detrex
📝Compared to other models on the leaderboard, DINO significantly reduces its model size and pre-training data size while achieving better results.
https://github.com/IDEA-Research/detrex
GitHub
GitHub - IDEA-Research/detrex: detrex is a research platform for DETR-based object detection, segmentation, pose estimation and…
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks. - IDEA-Research/detrex
Deep Learning for Medical Image Segmentation: Tricks, Challenges and Future Directions
📝Over the past few years, the rapid development of deep learning technologies for computer vision has greatly promoted the performance of medical image segmentation (MedISeg).
https://github.com/hust-linyi/seg_trick
📝Over the past few years, the rapid development of deep learning technologies for computer vision has greatly promoted the performance of medical image segmentation (MedISeg).
https://github.com/hust-linyi/seg_trick
GitHub
GitHub - hust-linyi/MedISeg
Contribute to hust-linyi/MedISeg development by creating an account on GitHub.
High-Resolution Image Synthesis with Latent Diffusion Models
📝By decomposing the image formation process into a sequential application of denoising autoencoders, diffusion models (DMs) achieve state-of-the-art synthesis results on image data and beyond.
https://github.com/compvis/stable-diffusion
📝By decomposing the image formation process into a sequential application of denoising autoencoders, diffusion models (DMs) achieve state-of-the-art synthesis results on image data and beyond.
https://github.com/compvis/stable-diffusion
GitHub
GitHub - CompVis/stable-diffusion: A latent text-to-image diffusion model
A latent text-to-image diffusion model. Contribute to CompVis/stable-diffusion development by creating an account on GitHub.
USB: A Unified Semi-supervised Learning Benchmark
📝Semi-supervised learning (SSL) improves model generalization by leveraging massive unlabeled data to augment limited labeled samples.
https://github.com/microsoft/semi-supervised-learning
📝Semi-supervised learning (SSL) improves model generalization by leveraging massive unlabeled data to augment limited labeled samples.
https://github.com/microsoft/semi-supervised-learning
GitHub
GitHub - microsoft/Semi-supervised-learning: A Unified Semi-Supervised Learning Codebase (NeurIPS'22)
A Unified Semi-Supervised Learning Codebase (NeurIPS'22) - GitHub - microsoft/Semi-supervised-learning: A Unified Semi-Supervised Learning Codebase (NeurIPS'22)
Advancing Model Pruning via Bi-level Optimization
📝To reduce the computation overhead, various efficient 'one-shot' pruning methods have been developed, but these schemes are usually unable to find winning tickets as good as IMP.
https://github.com/optml-group/bip
📝To reduce the computation overhead, various efficient 'one-shot' pruning methods have been developed, but these schemes are usually unable to find winning tickets as good as IMP.
https://github.com/optml-group/bip
GitHub
GitHub - OPTML-Group/BiP: [NeurIPS22] "Advancing Model Pruning via Bi-level Optimization" by Yihua Zhang*, Yuguang Yao*, Parikshit…
[NeurIPS22] "Advancing Model Pruning via Bi-level Optimization" by Yihua Zhang*, Yuguang Yao*, Parikshit Ram, Pu Zhao, Tianlong Chen, Mingyi Hong, Yanzhi Wang, and Sijia Liu - Git...
An Efficient Person Clustering Algorithm for Open Checkout-free Groceries
📝Then, to ensure that the method adapts to the dynamic and unseen person flow, we propose Graph Convolutional Network (GCN) with a simple Nearest Neighbor (NN) strategy to accurately cluster the instances of CSG.
https://github.com/WuJunde/checkoutfree
📝Then, to ensure that the method adapts to the dynamic and unseen person flow, we propose Graph Convolutional Network (GCN) with a simple Nearest Neighbor (NN) strategy to accurately cluster the instances of CSG.
https://github.com/WuJunde/checkoutfree
GitHub
GitHub - WuJunde/checkoutfree: It is a python implementation of the person clustering algorithm in the check-out free grocery visual…
It is a python implementation of the person clustering algorithm in the check-out free grocery visual system. - GitHub - WuJunde/checkoutfree: It is a python implementation of the person clustering...
NerfAcc: A General NeRF Acceleration Toolbox
📝We propose NerfAcc, a toolbox for efficient volumetric rendering of radiance fields.
https://github.com/kair-bair/nerfacc
📝We propose NerfAcc, a toolbox for efficient volumetric rendering of radiance fields.
https://github.com/kair-bair/nerfacc
GitHub
GitHub - nerfstudio-project/nerfacc: A General NeRF Acceleration Toolbox in PyTorch.
A General NeRF Acceleration Toolbox in PyTorch. Contribute to nerfstudio-project/nerfacc development by creating an account on GitHub.
Human Motion Diffusion Model
📝In this paper, we introduce Motion Diffusion Model (MDM), a carefully adapted classifier-free diffusion-based generative model for the human motion domain.
https://github.com/guytevet/motion-diffusion-model
📝In this paper, we introduce Motion Diffusion Model (MDM), a carefully adapted classifier-free diffusion-based generative model for the human motion domain.
https://github.com/guytevet/motion-diffusion-model
GitHub
GitHub - GuyTevet/motion-diffusion-model: The official PyTorch implementation of the paper "Human Motion Diffusion Model"
The official PyTorch implementation of the paper "Human Motion Diffusion Model" - GuyTevet/motion-diffusion-model
VToonify: Controllable High-Resolution Portrait Video Style Transfer
📝Although a series of successful portrait image toonification models built upon the powerful StyleGAN have been proposed, these image-oriented methods have obvious limitations when applied to videos, such as the fixed frame size, the requirement of face alignment, missing non-facial details and temporal inconsistency.
https://github.com/williamyang1991/vtoonify
📝Although a series of successful portrait image toonification models built upon the powerful StyleGAN have been proposed, these image-oriented methods have obvious limitations when applied to videos, such as the fixed frame size, the requirement of face alignment, missing non-facial details and temporal inconsistency.
https://github.com/williamyang1991/vtoonify
GitHub
GitHub - williamyang1991/VToonify: [SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer
[SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer - williamyang1991/VToonify