Проекты машинного обучения

RePaint: Inpainting using Denoising Diffusion Probabilistic Models

📝In this work, we propose RePaint: A Denoising Diffusion Probabilistic Model (DDPM) based inpainting approach that is applicable to even extreme masks.
https://github.com/andreas128/RePaint

GitHub

GitHub - andreas128/RePaint: Official PyTorch Code and Models of "RePaint: Inpainting using Denoising Diffusion Probabilistic Models"…

Official PyTorch Code and Models of "RePaint: Inpainting using Denoising Diffusion Probabilistic Models", CVPR 2022 - andreas128/RePaint

20 viewsedited 07:13

Проекты машинного обучения

Reconstructing 3D Human Pose by Watching Humans in the Mirror

In this paper, we introduce the new task of reconstructing 3D human pose from a single image in which we can see the person and the person's image through a mirror.
https://github.com/zju3dv/EasyMocap

19 views07:00

Проекты машинного обучения

Pretraining is All You Need for Image-to-Image Translation

We propose to use pretraining to boost general image-to-image translation.

https://github.com/PITI-Synthesis/PITI

👍1

20 views07:00

Проекты машинного обучения

Elucidating the Design Space of Diffusion-Based Generative Models

We argue that the theory and practice of diffusion-based generative models are currently unnecessarily convoluted and seek to remedy the situation by presenting a design space that clearly separates the concrete design choices.

https://github.com/lucidrains/imagen-pytorch

21 views06:00

Проекты машинного обучения

Ivy: Templated Deep Learning for Inter-Framework Portability

We introduce Ivy, a templated Deep Learning (DL) framework which abstracts existing DL frameworks.

https://github.com/ivy-dl/ivy

22 views06:00

Проекты машинного обучения

This media is not supported in your browser

VIEW IN TELEGRAM

Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion Models

In RDMs, a set of nearest neighbors is retrieved from an external database during training for each training instance, and the diffusion model is conditioned on these informative samples.

https://github.com/compvis/latent-diffusion

28 views08:58

Проекты машинного обучения

This media is not supported in your browser

VIEW IN TELEGRAM

Flow-Guided Transformer for Video Inpainting

Especially in spatial transformer, we design a dual perspective spatial MHSA, which integrates the global tokens to the window-based attention.
https://github.com/hitachinsk/fgt

27 views06:49

Проекты машинного обучения

LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale

📝We develop a procedure for Int8 matrix multiplication for feed-forward and attention projection layers in transformers, which cut the memory needed for inference by half while retaining full precision performance.
https://github.com/timdettmers/bitsandbytes

GitHub

GitHub - TimDettmers/bitsandbytes: 8-bit CUDA functions for PyTorch

8-bit CUDA functions for PyTorch. Contribute to TimDettmers/bitsandbytes development by creating an account on GitHub.

23 viewsedited 06:18

Проекты машинного обучения

KeypointNeRF: Generalizing Image-based Volumetric Avatars using Relative Spatial Encoding of Keypoints

📝In this work, we investigate common issues with existing spatial encodings and propose a simple yet highly effective approach to modeling high-fidelity volumetric humans from sparse views.
https://github.com/facebookresearch/KeypointNeRF

GitHub

GitHub - facebookresearch/KeypointNeRF: KeypointNeRF Generalizing Image-based Volumetric Avatars using Relative Spatial Encoding…

KeypointNeRF Generalizing Image-based Volumetric Avatars using Relative Spatial Encoding of Keypoints - facebookresearch/KeypointNeRF

24 viewsedited 07:06

Проекты машинного обучения

Collaborative Neural Rendering using Anime Character Sheets

📝Drawing images of characters with desired poses is an essential but laborious task in anime production.
https://github.com/megvii-research/conr

GitHub

GitHub - megvii-research/CoNR: Official implementation of CoNR: Collaborative Neural Rendering using Anime Character Sheets

Official implementation of CoNR: Collaborative Neural Rendering using Anime Character Sheets - GitHub - megvii-research/CoNR: Official implementation of CoNR: Collaborative Neural Rendering using A...

26 viewsedited 07:34

Проекты машинного обучения

Deep Patch Visual Odometry

📝We propose Deep Patch Visual Odometry (DPVO), a new deep learning system for monocular Visual Odometry (VO).
https://github.com/princeton-vl/dpvo

GitHub

GitHub - princeton-vl/DPVO: Deep Patch Visual Odometry/SLAM

Deep Patch Visual Odometry/SLAM. Contribute to princeton-vl/DPVO development by creating an account on GitHub.

28 viewsedited 06:36

Проекты машинного обучения

StyleFaceV: Face Video Generation via Decomposing and Recomposing Pretrained StyleGAN3

📝Notably, StyleFaceV is capable of generating realistic $1024\times1024$ face videos even without high-resolution training videos.
https://github.com/arthur-qiu/stylefacev

GitHub

GitHub - arthur-qiu/StyleFaceV: Code for StyleFaceV

Code for StyleFaceV. Contribute to arthur-qiu/StyleFaceV development by creating an account on GitHub.

33 viewsedited 07:25

Проекты машинного обучения

Multi-instrument Music Synthesis with Spectrogram Diffusion

📝An ideal music synthesizer should be both interactive and expressive, generating high-fidelity audio in realtime for arbitrary combinations of instruments and notes.

https://github.com/magenta/music-spectrogram-diffusion

GitHub

GitHub - magenta/music-spectrogram-diffusion

Contribute to magenta/music-spectrogram-diffusion development by creating an account on GitHub.

30 viewsedited 08:58

Проекты машинного обучения

MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction
📝We adopt a hierarchical query embedding scheme to flexibly encode structured map information and perform hierarchical bipartite matching for map element learning.

https://github.com/hustvl/maptr

GitHub

GitHub - hustvl/MapTR: [ICLR'23 Spotlight] MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction

[ICLR'23 Spotlight] MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction - GitHub - hustvl/MapTR: [ICLR'23 Spotlight] MapTR: Structured Modeling and Lea...

42 views09:09

Проекты машинного обучения

YOLOPv2: Better, Faster, Stronger for Panoptic Driving Perception

📝Over the last decade, multi-tasking learning approaches have achieved promising results in solving panoptic driving perception problems, providing both high-precision and high-efficiency performance.

https://github.com/CAIC-AD/YOLOPv2

GitHub

GitHub - CAIC-AD/YOLOPv2: YOLOPv2: Better, Faster, Stronger for Panoptic driving Perception

YOLOPv2: Better, Faster, Stronger for Panoptic driving Perception - CAIC-AD/YOLOPv2

65 views09:09

Проекты машинного обучения

Topology-aware Convolutional Neural Network for Efficient Skeleton-based Action Recognition

📝In particular, we develop a novel cross-channel feature augmentation module, which is a combo of map-attend-group-map operations.

https://github.com/hikvision-research/skelact

GitHub

GitHub - hikvision-research/skelact: Skeleton-based action recognition models in PyTorch, including Two-Stream CNN, HCN, HCN-Baseline…

Skeleton-based action recognition models in PyTorch, including Two-Stream CNN, HCN, HCN-Baseline, Ta-CNN and Dynamic GCN - hikvision-research/skelact

104 views09:09

Проекты машинного обучения

FILM: Frame Interpolation for Large Motion

📝Recent methods use multiple networks to estimate optical flow or depth and a separate network dedicated to frame synthesis.

https://github.com/google-research/frame-interpolation

GitHub

GitHub - google-research/frame-interpolation: FILM: Frame Interpolation for Large Motion, In ECCV 2022.

FILM: Frame Interpolation for Large Motion, In ECCV 2022. - google-research/frame-interpolation

👍1

125 views09:09

Проекты машинного обучения

Online Decision Transformer

📝Recent work has shown that offline reinforcement learning (RL) can be formulated as a sequence modeling problem (Chen et al., 2021; Janner et al., 2021) and solved via approaches similar to large-scale language modeling.

https://github.com/facebookresearch/online-dt

GitHub

GitHub - facebookresearch/online-dt: Online Decision Transformer

Online Decision Transformer. Contribute to facebookresearch/online-dt development by creating an account on GitHub.

99 views09:09

Проекты машинного обучения

YOLOX-PAI: An Improved YOLOX Version by PAI

📝We develop an all-in-one computer vision toolbox named EasyCV to facilitate the use of various SOTA computer vision methods.

https://github.com/alibaba/EasyCV

GitHub

GitHub - alibaba/EasyCV: An all-in-one toolkit for computer vision

An all-in-one toolkit for computer vision. Contribute to alibaba/EasyCV development by creating an account on GitHub.

65 views09:10

Проекты машинного обучения

This media is not supported in your browser

VIEW IN TELEGRAM

PeRFception: Perception using Radiance Fields

📝The recent progress in implicit 3D representation, i. e., Neural Radiance Fields (NeRFs), has made accurate and photorealistic 3D reconstruction possible in a differentiable manner.

https://github.com/POSTECH-CVLab/PeRFception

172 views14:12

About

Blog

Apps

Platform