AI with Papers - Artificial Intelligence & Deep Learning
15.2K subscribers
135 photos
247 videos
14 files
1.31K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸPeRFception: Largest IR Dataset๐Ÿ

๐Ÿ‘‰#Nvidia, a new frontier in data collection via Plenoxels: same info, -96.4% in size.

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…POSTECH + NVIDIA + Caltech = ๐Ÿคฏ
โœ…Size: -96.4% from original dataset!
โœ…2D/3D image/object class/semantic
โœ…Ready-to-use pipeline for implicit dataset

More: https://bit.ly/3eW9hJA
โค9โคโ€๐Ÿ”ฅ1๐Ÿ‘1๐Ÿ˜1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿธ CHARL-E: Stable Diffusion in 1 click ๐Ÿธ

๐Ÿ‘‰CHARL-E packages Stable Diffusion into a simple app.

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…No setup, dependencies, or internet
โœ…Images with 1-click on #macbook
โœ…Suitable only for M1/M2 processor
โœ…Source code under MIT license

More: https://bit.ly/3xv2z3G
๐Ÿ”ฅ11๐Ÿ‘3โคโ€๐Ÿ”ฅ1โค1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ‹YOLOPv2: Better Driving Perception๐Ÿ‹

๐Ÿ‘‰YOLOPv2: simultaneous object, road segmentation & lane detection

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…E2E perception net with better backbone
โœ…Efficient ELAN for reasonable memory
โœ…Stability for adapting to scenarios
โœ…SOTA on BDD100K, +50% faster!
โœ…Source code under MIT license

More: https://bit.ly/3LvYGBh
๐Ÿ”ฅ12
๐ŸˆSegNeXt: new SOTA in Semantic Seg.๐Ÿˆ

๐Ÿ‘‰SOTA (by large margin) on ADE20K, Cityscapes, COCO-Stuff, Pascal VOC, Pascal Context, and iSAID ๐Ÿคฏ

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Novel tailored network architecture
โœ…Spatial attention via multi-scale feats
โœ…Encoder + conv. better than transformers
โœ…SOTA on several datasets (ADE20K, etc.)

More: https://bit.ly/3UrZhrH
๐Ÿ”ฅ9๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸฆชStereoVoxelNet: RT Obstacles Detection๐Ÿฆช

๐Ÿ‘‰Novel deep neural approach to detect occupancy from stereo images directly

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Occupancy voxels via deep learning
โœ…RT on Jetson-TX2 (-98% CPU of SOTA)
โœ…Optimization via octrees / sparse conv.
โœ…Real-world stereo in/outdoor dataset

More: https://bit.ly/3BylAn3
๐Ÿ‘10๐Ÿฅฐ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿšœ NeRF-Factory: a NeRF collection ๐Ÿšœ

๐Ÿ‘‰PyTorch-reimplemented NeRF library with 7 popular models/implementations & 7 datasets

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…NeRF: Project | Paper | Code
โœ…NeRF++: Paper | Code
โœ…DVGO: Project | Paper v1/v2 | Code
โœ…Plenoxels: Project | Paper | Code
โœ…Mip-NeRF: Project | Paper | Code
โœ…Mip-NeRF360: Project | Paper | Code
โœ…Ref-NeRF: Project | Paper | Code

More: https://bit.ly/3qUgmgC
๐Ÿ‘7๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฅถ Lumos by #Nvidia: Relighting Portrait ๐Ÿฅถ

๐Ÿ‘‰The new SOTA in relighting without requiring a light stage

๐Ÿ˜ŽReview https://bit.ly/3dCH9ej
๐Ÿ˜ŽProject deepimagination.cc/Lumos
๐Ÿ˜ŽPaper arxiv.org/pdf/2209.10510.pdf
๐Ÿ˜ŽDemo https://imaginaire.cc/Lumos/
โค11๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿœ SURF-GAN: NeRF - >StyleGAN ๐Ÿœ

๐Ÿ‘‰ Editable portraits by injecting the NeRF's prior into StyleGAN

๐Ÿ˜ŽReview https://bit.ly/3SohEw3
๐Ÿ˜ŽProject jgkwak95.github.io/surfgan
๐Ÿ˜ŽPaper arxiv.org/pdf/2207.10257.pdf
๐Ÿ˜ŽCode github.com/jgkwak95/SURF-GAN
๐Ÿ‘4โค2โคโ€๐Ÿ”ฅ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ#Google just announced "TensorStore"๐Ÿ”ฅ

๐Ÿ‘‰Novel open-source C++ / #Python library for storage/manipulation of high-dim data

๐Ÿ˜ŽReview https://bit.ly/3DLwbha
๐Ÿ˜ŽProject https://bit.ly/3C4T2TR
๐Ÿ˜ŽCode github.com/google/tensorstore
๐Ÿ”ฅ14๐Ÿ‘2
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฆ  Motion Transformer for #selfdriving ๐Ÿฆ 

๐Ÿ‘‰The 1st place solution for 2022 #waymo "motion prediction" challenge

๐Ÿ˜ŽReview https://bit.ly/3f8G4LD
๐Ÿ˜ŽPaper arxiv.org/pdf/2209.10033.pdf
๐Ÿ˜ŽCode github.com/sshaoshuai/MTR
๐Ÿ”ฅ17๐Ÿ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ’น Image Synthesis @160+ FPS! ๐Ÿ’น

๐Ÿ‘‰Super-fast, 3D-Aware Image Synthesis with Sparse Voxels -> up to 167 FPS!

๐Ÿ˜ŽReview https://bit.ly/3r3ZNij
๐Ÿ˜ŽPaper arxiv.org/pdf/2206.07695.pdf
๐Ÿ˜ŽProject katjaschwarz.github.io/voxgraf
๐Ÿ‘3๐Ÿคฏ2๐Ÿ”ฅ1๐Ÿ’ฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ‘› #Nvidia GET3D: #3D generative #AI ๐Ÿ‘›

๐Ÿ‘‰AI-based Textured 3D meshes with complex topology, rich geometry & hi-fi textures

๐Ÿ˜ŽReview https://bit.ly/3SgnT5h
๐Ÿ˜ŽCode github.com/nv-tlabs/GET3D
๐Ÿ˜ŽProject nv-tlabs.github.io/GET3D/
๐Ÿ˜ŽPaper nv-tlabs.github.io/GET3D/assets/paper.pdf
โคโ€๐Ÿ”ฅ7๐Ÿ‘5
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ๐Ÿ”ฅ IDE-3D: source code is out! ๐Ÿ”ฅ๐Ÿ”ฅ

๐Ÿ‘‰Novel, photorealistic, 3D-aware facial generator: source code just released!

๐Ÿ˜ŽReview https://bit.ly/3BNrO2C
๐Ÿ˜ŽProject mrtornado24.github.io/IDE-3D/
๐Ÿ˜ŽCode github.com/MrTornado24/IDE-3D
๐Ÿ˜ŽPaper arxiv.org/pdf/2205.15517.pdf
๐Ÿคฏ8๐Ÿ‘5๐Ÿ”ฅ3๐Ÿคฉ3
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅDiffusion Model of Neural Checkpoints๐Ÿ”ฅ

๐Ÿ‘‰Conditional diffusion model on Millions of checkpoints of a given task/architecture ๐Ÿคฏ

๐Ÿ˜ŽReview https://bit.ly/3SBR4Qb
๐Ÿ˜ŽProject www.wpeebles.com/Gpt
๐Ÿ˜ŽCode github.com/wpeebles/G.pt
๐Ÿ˜ŽPaper arxiv.org/pdf/2209.12892.pdf
๐Ÿคฏ5โค1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ Semantic VISOR dataset is out! ๐Ÿ”ฅ

๐Ÿ‘‰Segmenting hands / active objects in egocentric video (millions masks)

๐Ÿ˜ŽReview https://bit.ly/3LOBLBv
๐Ÿ˜ŽProject epic-kitchens.github.io/VISOR/
๐Ÿ˜ŽPaper arxiv.org/pdf/2209.13064.pdf
๐Ÿคฏ8๐Ÿ”ฅ4๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฅ‡๐Ÿฅ‡ Olympic Games in 2028? ๐Ÿฅ‡๐Ÿฅ‡

๐Ÿ‘‰ In a few years, the fastest runner on earth will not be a human ๐Ÿฅถ

๐Ÿ˜ŽReview https://bit.ly/3Rme3O3
๐Ÿ˜ฑ8๐Ÿ‘3๐Ÿ‘Ž1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ SOTA ALERT: new Text-to-Video #AI ๐Ÿ”ฅ

๐Ÿ‘‰#META unveils a novel Text-to-Video (T2V) generation #AI

๐Ÿ˜ŽReview https://bit.ly/3E1ZDzG
๐Ÿ˜ŽProject https://makeavideo.studio/
๐Ÿ˜ŽPaper makeavideo.studio/Make-A-Video.pdf
๐Ÿคฏ9๐Ÿ‘6๐Ÿ˜ฑ1๐Ÿ’ฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅDreamFusion: Text-to-3D via Diffusion๐Ÿ”ฅ

๐Ÿ‘‰DeepDream-like procedure to create #3D assets just from a given text

๐Ÿ˜ŽReview https://bit.ly/3BYY5nu
๐Ÿ˜ŽPaper arxiv.org/pdf/2209.14988.pdf
๐Ÿ˜ŽProject dreamfusion3d.github.io/gallery.html
๐Ÿคฏ12๐Ÿ‘5๐Ÿ’ฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿงช Light Field Neural Rendering ๐Ÿงช

๐Ÿ‘‰Two-stage transformer capable of non-Lambertian effects (reflection, refraction, translucency)

๐Ÿ˜ŽReview https://bit.ly/3CpIFdm
๐Ÿ˜ŽPaper arxiv.org/pdf/2112.09687.pdf
๐Ÿ˜ŽProject light-field-neural-rendering.github.io
๐Ÿ˜ŽCode github.com/google-research/google-research/tree/master/light_field_neural_rendering
๐Ÿคฏ14๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸฆฉPhenaki: Text-to(LOOONG)Video generation๐Ÿฆฉ

๐Ÿ‘‰Phenaki is an #AI capable of realistic long video synthesis, given a sequence of textual open prompts

๐Ÿ˜ŽReview https://bit.ly/3RwUvXx
๐Ÿ˜ŽProject phenaki.video/index.h
๐Ÿ˜ŽPaper openreview.net/pdf?id=vOEXS39nOF
๐Ÿ”ฅ7โค3๐Ÿ‘1