Artificial Intelligence

💥 T5X is a modular, composable, research-friendly framework for high-performance, configurable, self-service training, evaluation, and inference of sequence models.

Github: https://github.com/google-research/t5x

Paper: https://arxiv.org/abs/2203.17189v1

@ArtificialIntelligencedl

👍4

1.65K views09:20

Artificial Intelligence

💻 TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing (CVPR 2022)

Recent advances like StyleGAN have promoted the growth of controllable facial editing.

Github: https://github.com/billyxyb/transeditor

Paper: https://arxiv.org/abs/2203.17266v1

@ArtificialIntelligencedl

👍6

1.67K views08:22

Artificial Intelligence

🔍 Exploiting Explainable Metrics for Augmented SGD

A new explainability metrics that measure the redundant information in a network's layers and exploit this information to augment the Stochastic Gradient Descent

Project

Code: https://github.com/mahdihosseini/rmsgd

Paper: https://arxiv.org/pdf/2203.16723v1.pdf

Dataset: https://paperswithcode.com/dataset/mhist

@ArtificialIntelligencedl

👍5🔥1

26.5K views11:13

Artificial Intelligence

🎆 Efficient Non-Autoregressive GAN Voice Conversion using VQWav2vec Features and Dynamic Convolution

Dynamic-GAN-VC (DYGAN-VC), uses a non-autoregressive structure and makes use of vector quantised embeddings obtained from a VQWav2vec model

Code: https://github.com/mingjiechen/dyganvc

Paper: https://arxiv.org/abs/2203.17172v1

Dataset: https://github.com/nii-yamagishilab/VCC2020-database

@ArtificialIntelligencedl

👍6

1.72K views09:24

Artificial Intelligence

➕ Rethinking Portrait Matting with Privacy Preserving

Code: https://github.com/mingjiechen/dyganvc

Paper: https://arxiv.org/abs/2203.16828v1

Dataset: https://github.com/vitae-transformer/vitae-transformer-matting#ppt-setting-and-p3m-10k-dataset

@ArtificialIntelligencedl

👍4

1.68K views10:06

Artificial Intelligence

📊 MultiMAE: Multi-modal Multi-task Masked Autoencoders

An efficient and effective pre-training strategy for Vision Transformers

Project: https://multimae.epfl.ch/

Code: https://github.com/EPFL-VILAB/MultiMAE

Paper: https://arxiv.org/abs/2204.01678

Project: https://multimae.epfl.ch/

@ArtificialIntelligencedl

👍3

1.68K views08:31

Artificial Intelligence

📌 TESTR: Text Spotting Transformers

TExt Spotting TRansformers (TESTR), a generic end-to-end text spotting framework using Transformers for text detection and recognition in the wild

Code: https://github.com/mlpc-ucsd/testr

Paper: https://arxiv.org/abs/2204.01918

Dataset: https://ucsdcloud-my.sharepoint.com/:u:/g/personal/xiz102_ucsd_edu/EWgEM5BSRjBEua4B_qLrGR0BaombUL8K3d23ldXOb7wUNA?e=7VzH34

@ArtificialIntelligencedl

👍6👏1

1.74K views10:31

Artificial Intelligence

MIMDet 🎭

Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection

Code: https://github.com/hustvl/mimdet

Paper: https://arxiv.org/abs/2204.02964v1

Dataset: https://paperswithcode.com/dataset/coco

Pretrained Model: https://dl.fbaipublicfiles.com/mae/pretrain/mae_pretrain_vit_base_full.pth

@ArtificialIntelligencedl

🔥4❤1👍1

1.79K views08:12

Artificial Intelligence

📐 FineDiving: A Fine-grained Dataset for Procedure-aware Action Quality Assessment

A new fine-grained dataset, called FineDiving, developed on diverse diving events with detailed annotations on action procedures

Code: https://github.com/xujinglin/finediving

Paper: https://arxiv.org/abs/2204.03646v1

Dataset: https://pan.baidu.com/s/1v85-np2FbS0J4UfAEiI4mg

@ArtificialIntelligencedl

👍7

2.64K views06:56

Artificial Intelligence

🖋 Context-Sensitive Temporal Feature Learning for Gait Recognition

Code: https://github.com/oliverhxh/cstl

Paper: https://arxiv.org/abs/2204.03270v1

@ArtificialIntelligencedl

❤5👍1

1.8K views07:50

Artificial Intelligence

DaViT: Dual Attention Vision Transformer

Code: https://github.com/dingmyu/davit

Paper: https://arxiv.org/abs/2204.03645v1

Dataset: https://paperswithcode.com/dataset/ade20k

@ArtificialIntelligencedl

👍4

1.97K views06:10

Artificial Intelligence

Vision Transformers for Single Image Dehazing

Code: https://github.com/IDKiro/DehazeFormer

Paper: https://arxiv.org/abs/2204.03883v1

Dataset: https://paperswithcode.com/dataset/rs-haze

@ArtificialIntelligencedl

1.79K views06:45

Artificial Intelligence

SuperGAT

A self-supervised graph attention network (SuperGAT), an improved graph attention model for noisy graph

Code: https://github.com/dongkwan-kim/SuperGAT

Paper: https://arxiv.org/abs/2204.04879v1

Dataset: https://paperswithcode.com/dataset/ogb

@ArtificialIntelligencedl

👍8😁1

2.28K views07:46

Artificial Intelligence

FederatedScope-GNN: Towards a Unified, Comprehensive and Efficient Package for Federated Graph Learning

Code: https://github.com/alibaba/federatedscope

Paper: https://arxiv.org/abs/2204.05562v1

@ArtificialIntelligencedl

👍3

1.7K views07:49

Artificial Intelligence

🔥 DALL·E 2

DALL·E 2 is a new AI system that can create realistic images and art from a description in natural language.

Openai: https://openai.com/dall-e-2/

Paper: https://cdn.openai.com/papers/dall-e-2.pdf

Video: https://vimeo.com/692375454

👍5

1.78K viewsedited 07:01

Artificial Intelligence

Understanding Engagement from Video Screengrabs

Code: https://github.com/wanghewei16/video-engagement-analysis

Paper: https://arxiv.org/abs/2204.06454v1

The data source: https://github.com/e-drishti/wacv2016.

@ArtificialIntelligencedl

🔥4

1.93K views09:02

Artificial Intelligence