AI with Papers - Artificial Intelligence & Deep Learning
15.2K subscribers
135 photos
247 videos
14 files
1.31K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸŒ… Semantics-guided natural synthesis πŸŒ…

πŸ‘‰Alibaba #AI unveils a novel semantics-guided synthesis of natural scenes

😎Review https://bit.ly/4115MVJ
😎Paper arxiv.org/pdf/2302.07224.pdf
😎Project zju3dv.github.io/paintingnature
πŸ‘5πŸ”₯1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🦞 SOTA ALERT: YOWOv2 is out! 🦞

πŸ‘‰ The 2nd-gen of YOWO, real-time detection of spatio-temporal actions

😎Review https://bit.ly/3IscY60
😎Paper arxiv.org/pdf/2302.06848v1.pdf
😎Code github.com/yjh0410/YOWOv2
πŸ”₯17πŸ‘2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ“¬ DIVOTrack: crossview MOT dataset πŸ“¬

πŸ‘‰ DIVOTrack + CrossMOT: the ultimate solution for MOT in realistic scenario

😎Review https://bit.ly/3YSFZgL
😎Paper arxiv.org/pdf/2302.07676.pdf
😎Code github.com/shengyuhao/DIVOTrack
πŸ”₯6πŸ‘2🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🦩 One-Shot Face via LSs of StyleGAN2 🦩

πŸ‘‰ Novel video generation framework with edits, facial motions, deformations & identity

😎Review https://bit.ly/3xuChhF
😎Paper arxiv.org/pdf/2302.07848.pdf
😎Project trevineoorloff.github.io/FaceVideoReenactment_HybridLatents.io/
🀯3😱2⚑1πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🌢️ 3D-aware conditional generative AI 🌢️

πŸ‘‰ Pix2Pix3D: 3D-aware conditional generative AI for controllable photorealistic synthesis

😎Review https://bit.ly/3I80MWS
😎Paper arxiv.org/pdf/2302.08509.pdf
😎Project www.cs.cmu.edu/~pix2pix3D
😎Code github.com/dunbar12138/pix2pix3D
πŸ”₯4πŸ‘2⚑1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ›‘οΈ TPV: Tesla's O-Net competitor πŸ›‘οΈ

πŸ‘‰From Beijing an open-source approach for vision-centric autonomous driving #3D perception

😎Review https://bit.ly/3lNvVYc
😎Paper arxiv.org/pdf/2302.07817.pdf
😎Code github.com/wzzheng/TPVFormer
πŸ‘7πŸ”₯3🀯3😱1
πŸ€ #NBA Mixed Reality is NUTS πŸ€

πŸ‘‰The premiere of the streaming app of the #NBA is totally INSANE. A mix of #AI, CG and much moreπŸ‘‡

πŸ€More: https://bit.ly/3IJ3uUp
🀯10πŸ‘5❀1😱1🀩1πŸ’©1
This media is not supported in your browser
VIEW IN TELEGRAM
🫳 Neural Relighting of Hands 🫴

πŸ‘‰#META unveil the first neural relighting for personalized hands in real-time under novel illumination

😎Review https://bit.ly/3SblmKC
😎Paper arxiv.org/pdf/2302.04866.pdf
😎Project sh8.io/#/relightable_hands
πŸ₯°4πŸ‘3😱3πŸ”₯2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ VoxFormer: 2D->#3D Voxel ViTπŸͺ

πŸ‘‰#Nvidia VoxFormer: #3D volumetric semantics from 2D images

😎Review https://bit.ly/3Kw9Yab
😎Paper arxiv.org/pdf/2302.12251.pdf
😎Code github.com/NVlabs/VoxFormer
πŸ”₯11🀯3πŸ‘2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺžDisCO: Selfie Correction with 3D-GANπŸͺž

πŸ‘‰Snap (et al.) unveils a GAN-based method for correcting distortions in close-up faces

😎Review https://bit.ly/3StGGuX
😎Paper arxiv.org/pdf/2302.12253.pdf
😎Project https://portrait-disco.github.io
πŸ”₯8πŸ₯°3⚑1πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
⚽️ Vid2Avatar: 3D Avatar from Videos ⚽️

πŸ‘‰Vid2Avatar: detailed 3D avatar from monocular videos in the wild

😎Review https://bit.ly/3ISbceD
😎Paper arxiv.org/pdf/2302.11566.pdf
😎Project moygcc.github.io/vid2avatar
😎Code (soon) github.com/MoyGcc
🀯18πŸ‘11πŸ”₯8😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‰ SLAHMR: 4D People from Clip in-the-Wild πŸ‰

πŸ‘‰UC-Berkeley unveils SLAHMR: novel method to reconstruct global human trajectories from videos

😎Review https://bit.ly/3SzTIaj
😎Paper arxiv.org/pdf/2302.12827.pdf
😎Project vye16.github.io/slahmr/
😎Code github.com/vye16/slahmr
πŸ‘10πŸ”₯8❀2
πŸ‡ SplineCam: Neural Decision Boundary πŸ‡

πŸ‘‰#META -> SplineCam: a step towards neural visualization / interpretability

😎Review https://bit.ly/3mgoOaH
😎Paper arxiv.org/pdf/2302.12828.pdf
😎Project imtiazhumayun.github.io/splinecam
😎Code github.com/AhmedImtiazPrio/SplineCAM
🀯8πŸ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘‘ ControNet: Conditional Control of Diffusion πŸ‘‘

πŸ‘‰Controlling Stable Diffusion via conditional inputs like edges, segmentation, keypoints, etc. Extra: a super-nice tutorial.

😎Review https://bit.ly/3YgjrWt
😎Paper arxiv.org/pdf/2302.05543.pdf
😎Code github.com/lllyasviel/ControlNet
😎Tutorial https://github.com/Mikubill/sd-webui-controlnet/discussions/204
🀯15πŸ‘8πŸ”₯3❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ›Έ TAU: video traffic analytics via UAVs πŸ›Έ

πŸ‘‰ Prince Sultan University unveils TAU: AI-integrated video analytics framework from UAVs' POV

😎Review https://bit.ly/3EQIh8F
😎Paper arxiv.org/pdf/2303.00337.pdf
😎Project github.com/bilel-bj/TAU
πŸ”₯10πŸ‘3πŸ₯°1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🩻 Independent Tokens for 3D Human 🩻

πŸ‘‰Tencent open-sourcing a novel method to estimate #3D human pose and shape from monocular videos

😎Review https://bit.ly/3Zz0uiH
😎Paper arxiv.org/pdf/2303.00298.pdf
😎Code github.com/yangsenius/INT_HMR_Model
😎Project yangsenius.github.io/INT_HMR_Model/index.html
πŸ”₯5πŸ‘1😒1
This media is not supported in your browser
VIEW IN TELEGRAM
🌸 3DGP: ImageNet in #3D 🌸

πŸ‘‰ Snap unveils 3DGP: a novel 3D generator with Generic Priors

😎Review https://bit.ly/3KWHUgG
😎Paper arxiv.org/pdf/2303.01416.pdf
😎Project snap-research.github.io/3dgp/
😎Code github.com/snap-research/3dgp
πŸ”₯8⚑1πŸ‘1
Media is too big
VIEW IN TELEGRAM
πŸ—ΊοΈ S-NeRF: NeRF for Street Views πŸ—ΊοΈ

πŸ‘‰S-NeRF: novel view synthesis of streets & foreground moving vehicles jointly

😎Review https://bit.ly/3KZUN9w
😎Paper arxiv.org/pdf/2303.00749.pdf
😎Project ziyang-xie.github.io/s-nerf/
😎Code (soon)
πŸ‘9πŸ”₯3🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🧱 MobileBrick: #3D object on mobile 🧱

πŸ‘‰#Apple (+Oxford) exploiting #LEGO bricks to open the most precise #3D dataset ever. Suitable for mobile #AR

😎Review https://bit.ly/3ZqbiAh
😎Paper arxiv.org/pdf/2303.01932.pdf
😎Project code.active.vision/MobileBrick/
😎Code github.com/ActiveVisionLab/MobileBrick
πŸ”₯6πŸ‘2😱1
This media is not supported in your browser
VIEW IN TELEGRAM
⚠️ BREAKING: Stability Acquires Init ML ⚠️

πŸ‘‰ Stability AI (#stablediffusion) announces the acquisition of Clipdrop makers 🀯

πŸ‘‰ More: https://bit.ly/3JhKkVO
🀯7❀2πŸ‘2πŸ₯°1πŸ‘1😱1🍾1