AI with Papers - Artificial Intelligence & Deep Learning
15.2K subscribers
136 photos
250 videos
14 files
1.31K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ A Survey on Diffusion Models πŸ”₯

πŸ‘‰A comprehensive review of denoising diffusion models in #computervision 🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Overview on diffusion models
βœ…Hot trend for the generative AI
βœ…A multi-perspective categorization
βœ…Current limitations / new directions

More: https://bit.ly/3RYG5zP
❀5πŸ‘3πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‰#AI finds where IG photos are takenπŸ‰

πŸ‘‰Brilliant work of Depoorter, Belgium artist that handles #privacy, #AI & #socialmedia

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Recorded open cameras for weeks
βœ…Scraped all #Instagram photos
βœ…Matching Instagram vs. footage

More: https://bit.ly/3eL5dfc
😱18πŸ‘13πŸ₯°2
This media is not supported in your browser
VIEW IN TELEGRAM
🈯SAMURAI: in-the-wild Shape/Material🈯

πŸ‘‰#Google SAMURAI: shape, BRDF, per-image pose & illumination. Relightable #3D assets for #AR/#VR.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Parametrization for varying distances
βœ…Camera multiplex optimization
βœ…Posterior scaling of input images
βœ…Explicit meshes extraction with BRDF
βœ…Code/data soon available ->#NeurIPS

More: https://bit.ly/3BKWgf3
πŸ‘8πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
🟨 Lang<->Pics in 100+ Languages 🟨

πŸ‘‰#Google PaLI: unified lang-image #AI to perform tasks in 109 languages 🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…PaLI: Pathways Lang & Image model
βœ…Answering, captioning, reasoning, etc
βœ…From Eng. to 109 lang. understanding
βœ…The new SOTA on several datasets

More: https://bit.ly/3QMslHC
πŸ”₯6πŸ‘1πŸ’―1
This media is not supported in your browser
VIEW IN TELEGRAM
🍐PeRFception: Largest IR Dataset🍐

πŸ‘‰#Nvidia, a new frontier in data collection via Plenoxels: same info, -96.4% in size.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…POSTECH + NVIDIA + Caltech = 🀯
βœ…Size: -96.4% from original dataset!
βœ…2D/3D image/object class/semantic
βœ…Ready-to-use pipeline for implicit dataset

More: https://bit.ly/3eW9hJA
❀9❀‍πŸ”₯1πŸ‘1😍1
This media is not supported in your browser
VIEW IN TELEGRAM
🐸 CHARL-E: Stable Diffusion in 1 click 🐸

πŸ‘‰CHARL-E packages Stable Diffusion into a simple app.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…No setup, dependencies, or internet
βœ…Images with 1-click on #macbook
βœ…Suitable only for M1/M2 processor
βœ…Source code under MIT license

More: https://bit.ly/3xv2z3G
πŸ”₯11πŸ‘3❀‍πŸ”₯1❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‹YOLOPv2: Better Driving PerceptionπŸ‹

πŸ‘‰YOLOPv2: simultaneous object, road segmentation & lane detection

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…E2E perception net with better backbone
βœ…Efficient ELAN for reasonable memory
βœ…Stability for adapting to scenarios
βœ…SOTA on BDD100K, +50% faster!
βœ…Source code under MIT license

More: https://bit.ly/3LvYGBh
πŸ”₯12
🍈SegNeXt: new SOTA in Semantic Seg.🍈

πŸ‘‰SOTA (by large margin) on ADE20K, Cityscapes, COCO-Stuff, Pascal VOC, Pascal Context, and iSAID 🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Novel tailored network architecture
βœ…Spatial attention via multi-scale feats
βœ…Encoder + conv. better than transformers
βœ…SOTA on several datasets (ADE20K, etc.)

More: https://bit.ly/3UrZhrH
πŸ”₯9πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ¦ͺStereoVoxelNet: RT Obstacles DetectionπŸ¦ͺ

πŸ‘‰Novel deep neural approach to detect occupancy from stereo images directly

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Occupancy voxels via deep learning
βœ…RT on Jetson-TX2 (-98% CPU of SOTA)
βœ…Optimization via octrees / sparse conv.
βœ…Real-world stereo in/outdoor dataset

More: https://bit.ly/3BylAn3
πŸ‘10πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
🚜 NeRF-Factory: a NeRF collection 🚜

πŸ‘‰PyTorch-reimplemented NeRF library with 7 popular models/implementations & 7 datasets

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…NeRF: Project | Paper | Code
βœ…NeRF++: Paper | Code
βœ…DVGO: Project | Paper v1/v2 | Code
βœ…Plenoxels: Project | Paper | Code
βœ…Mip-NeRF: Project | Paper | Code
βœ…Mip-NeRF360: Project | Paper | Code
βœ…Ref-NeRF: Project | Paper | Code

More: https://bit.ly/3qUgmgC
πŸ‘7🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯Ά Lumos by #Nvidia: Relighting Portrait πŸ₯Ά

πŸ‘‰The new SOTA in relighting without requiring a light stage

😎Review https://bit.ly/3dCH9ej
😎Project deepimagination.cc/Lumos
😎Paper arxiv.org/pdf/2209.10510.pdf
😎Demo https://imaginaire.cc/Lumos/
❀11πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🍜 SURF-GAN: NeRF - >StyleGAN 🍜

πŸ‘‰ Editable portraits by injecting the NeRF's prior into StyleGAN

😎Review https://bit.ly/3SohEw3
😎Project jgkwak95.github.io/surfgan
😎Paper arxiv.org/pdf/2207.10257.pdf
😎Code github.com/jgkwak95/SURF-GAN
πŸ‘4❀2❀‍πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯#Google just announced "TensorStore"πŸ”₯

πŸ‘‰Novel open-source C++ / #Python library for storage/manipulation of high-dim data

😎Review https://bit.ly/3DLwbha
😎Project https://bit.ly/3C4T2TR
😎Code github.com/google/tensorstore
πŸ”₯14πŸ‘2
This media is not supported in your browser
VIEW IN TELEGRAM
🦠 Motion Transformer for #selfdriving 🦠

πŸ‘‰The 1st place solution for 2022 #waymo "motion prediction" challenge

😎Review https://bit.ly/3f8G4LD
😎Paper arxiv.org/pdf/2209.10033.pdf
😎Code github.com/sshaoshuai/MTR
πŸ”₯17πŸ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ’Ή Image Synthesis @160+ FPS! πŸ’Ή

πŸ‘‰Super-fast, 3D-Aware Image Synthesis with Sparse Voxels -> up to 167 FPS!

😎Review https://bit.ly/3r3ZNij
😎Paper arxiv.org/pdf/2206.07695.pdf
😎Project katjaschwarz.github.io/voxgraf
πŸ‘3🀯2πŸ”₯1πŸ’―1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘› #Nvidia GET3D: #3D generative #AI πŸ‘›

πŸ‘‰AI-based Textured 3D meshes with complex topology, rich geometry & hi-fi textures

😎Review https://bit.ly/3SgnT5h
😎Code github.com/nv-tlabs/GET3D
😎Project nv-tlabs.github.io/GET3D/
😎Paper nv-tlabs.github.io/GET3D/assets/paper.pdf
❀‍πŸ”₯7πŸ‘5
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯πŸ”₯ IDE-3D: source code is out! πŸ”₯πŸ”₯

πŸ‘‰Novel, photorealistic, 3D-aware facial generator: source code just released!

😎Review https://bit.ly/3BNrO2C
😎Project mrtornado24.github.io/IDE-3D/
😎Code github.com/MrTornado24/IDE-3D
😎Paper arxiv.org/pdf/2205.15517.pdf
🀯8πŸ‘5πŸ”₯3🀩3
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯Diffusion Model of Neural CheckpointsπŸ”₯

πŸ‘‰Conditional diffusion model on Millions of checkpoints of a given task/architecture 🀯

😎Review https://bit.ly/3SBR4Qb
😎Project www.wpeebles.com/Gpt
😎Code github.com/wpeebles/G.pt
😎Paper arxiv.org/pdf/2209.12892.pdf
🀯5❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ Semantic VISOR dataset is out! πŸ”₯

πŸ‘‰Segmenting hands / active objects in egocentric video (millions masks)

😎Review https://bit.ly/3LOBLBv
😎Project epic-kitchens.github.io/VISOR/
😎Paper arxiv.org/pdf/2209.13064.pdf
🀯8πŸ”₯4πŸ‘1