AI with Papers - Artificial Intelligence & Deep Learning
15.2K subscribers
135 photos
247 videos
14 files
1.31K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
🌱 Segment Everything Everywhere 🌱

πŸ‘‰ Segmenting everything using visual/language prompts (BBs, scribbles, text & audio)

😎Review https://bit.ly/3LEiOmx
😎Paper arxiv.org/pdf/2304.06718.pdf
😎Demo huggingface.co/spaces/xdecoder/SEEM
😎Code github.com/UX-Decoder/Segment-Everything-Everywhere-All-At-Once
πŸ”₯13❀4🀯1🀩1
πŸ¦’ Look mom, I'm a giraffe πŸ¦’

πŸ‘‰ A patent to transpose adversarial patches onto a knitted fabric. Be undetectable or associated with incorrect category such as "animal" (giraffe, zebra, etc)

😎 More: https://bit.ly/3LzjSGV
❀20πŸ‘4🀩4πŸ”₯3πŸ’©3πŸ‘1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🐊 RelPose++: SOTA 6D from 2-8 pics 🐊

πŸ‘‰CMU unveils a novel neural method for 6D camera poses from only 2-8 images

😎Review https://bit.ly/42ioJ6K
😎Paper arxiv.org/pdf/2305.04926.pdf
😎Project amyxlase.github.io/relpose-plus-plus
😎Code github.com/amyxlase/relpose-plus-plus
πŸ”₯16🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ¦• 6D Non-Prehensile Manipulation πŸ¦•

πŸ‘‰#META (+CMU) unveils HACMan, novel 6D non-prehensile manipulation of objects

😎Review https://bit.ly/3NP1jl1
😎Paper arxiv.org/pdf/2305.03942.pdf
😎Project hacman-2023.github.io
πŸ‘6πŸ”₯4🀯3😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ›Έ Virtual Occlusions in #AR πŸ›Έ

πŸ‘‰Niantic (#pokemongo) on a novel approach for virtual assets to appear β€˜sitting among’ the real world objects

😎Review https://bit.ly/3o04wn6
😎Paper arxiv.org/pdf/2305.07014.pdf
😎Project nianticlabs.github.io/implicit-depth
😎Code github.com/nianticlabs/implicit-depth
πŸ”₯11🀯5πŸ‘3⚑1🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
🍿 De-Aging Harrison Ford via SD 🍿

πŸ‘‰Stable Diffusion for Hollywood: preview of the next autotune of entertainment industry. A discussionπŸ‘‡

😎 More: https://bit.ly/41EzaQK
🀯19πŸ”₯9πŸ‘6πŸ’©3⚑1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ° #3D Auto-Reconstruction πŸͺ°

πŸ‘‰AutoRecon: automated discovery & reconstruction of objects from multi-view pics.

😎Review https://bit.ly/3MxI0f4
😎Paper arxiv.org/pdf/2305.08810.pdf
😎Project zju3dv.github.io/autorecon/
😎Code github.com/zju3dv/AutoRecon
πŸ”₯11❀4🀯3πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘š Multi-Layered 3D Garments Animation πŸ‘š

πŸ‘‰S-Lab unveils LayersNet: animating multi-layered garments driven by various external forces, such as human bodies & wind

😎Review https://bit.ly/435b42F
😎Paper arxiv.org/pdf/2305.10418.pdf
😎Project mmlab-ntu.github.io/project/layersnet
πŸ”₯6😱2❀1πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🎫 100% Mask-Free VIS 🎫

πŸ‘‰ETH Z unveils MaskFreeVIS: novel high-performing VIS without any mask annotations.

😎Review https://bit.ly/3Wg7CQB
😎Paper arxiv.org/pdf/2303.15904.pdf
😎Project www.vis.xyz/pub/maskfreevis/
😎Code github.com/SysCV/maskfreevis
πŸ”₯6πŸ‘4🀯2❀1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ€„ Drag-GAN: user-friendly image-manipulation πŸ€„

πŸ‘‰ Manual deforming of (real and generated) images over pose, shape, expression and layout.

😎Review https://bit.ly/3BFyXlR
😎Paper arxiv.org/pdf/2305.10973.pdf
😎Project vcai.mpi-inf.mpg.de/projects/DragGAN
😎Code github.com/XingangPan/DragGAN
πŸ”₯34🀯18❀6πŸ‘4😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ—ΊοΈ AI-generated stereotypical men πŸ—ΊοΈ

πŸ‘‰A thread about generating stereotypical person from 15 countries all around the world. And yes, Italian love Pizza.

😎 More https://bit.ly/3oo0t4c
🀣6❀3🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🍢 AVOS Multiscale Encoder-Decoder ViT 🍢

πŸ‘‰ MED-VT, world's first Multiscale Encoder Decoder Video Transformer for AVOS

😎Review https://bit.ly/3MohFi1
😎Paper arxiv.org/pdf/2304.05930.pdf
😎Project rkyuca.github.io/medvt
😎Code github.com/rkyuca/medvt
πŸ‘13πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
🌊 Neural Dynamic Image-Based Rendering 🌊

πŸ‘‰ DynIBaR: synthesizing novel views from monocular video depicting a complex dynamic scene.

😎Review https://t.ly/90Kw
😎Paper arxiv.org/pdf/2211.11082.pdf
😎Project https://dynibar.github.io/
😎Code github.com/google/dynibar
❀9πŸ‘3πŸ₯°1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🦁 Open Semantic Segmentation 🦁

πŸ‘‰SSSegmentation: open source supervised semantic segmentation toolbox based on #PyTorch

😎Review https://t.ly/ZE9q
😎Paper arxiv.org/pdf/2305.17091.pdf
😎Code github.com/SegmentationBLWX/sssegmentation
πŸ”₯10❀4⚑1πŸ‘1🀯1🀩1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸŽ—οΈ 4D Humans with Transformers πŸŽ—οΈ

πŸ‘‰Novel approach to reconstruct and track humans (even in unusual poses)

😎Review https://t.ly/XGv_
😎Paper arxiv.org/pdf/2305.20091.pdf
😎Project shubham-goel.github.io/4dhumans/#
😎Code github.com/shubham-goel/4D-Humans
🀯10πŸ‘7πŸ”₯5❀2⚑1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ—½ Neuralangelo Digital Twins. INSANEπŸ—½

πŸ‘‰ A novel framework from #Nvidia for Hi-Fi 3D Digital twins.

😎Review https://t.ly/rxoF4
😎Project research.nvidia.com/labs/dir/neuralangelo
😎Paper research.nvidia.com/labs/dir/neuralangelo/paper.pdf
πŸ”₯15πŸ‘4🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🦜 ColorDiffuser: Text-to-Video Colorization 🦜

πŸ‘‰HK University unveils ColorDiffuser: adapting pre-trained text-to-image latent diffusion model for video colorization

😎Review https://t.ly/XGv_
😎Paper arxiv.org/pdf/2306.01732.pdf
😎Project colordiffuser.github.io/
😎Code github.com/ColorDiffuser/ColorDiffuser
🀯8❀2🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
🌻 Extending Mona Lisa with AI 🌻

πŸ‘‰ A guy on Reddit extends Mona Lisa Painting with #Photoshop AI. The result is surprising.

😎More https://t.ly/j_2r
🀯20πŸ‘5🀩4πŸ”₯3😱2🀣2⚑1
This media is not supported in your browser
VIEW IN TELEGRAM
🏸 Segment Anything in HQ 🏸

πŸ‘‰HQ-SAM: SAM with the ability to accurately segment objects, maintaining promptable design, efficiency, zero-shot generalizability

😎Review https://t.ly/GxX5B
😎Paper arxiv.org/pdf/2306.01567.pdf
😎Models github.com/SysCV/SAM-HQ
πŸ”₯18πŸ‘4🀯1😱1😍1
This media is not supported in your browser
VIEW IN TELEGRAM
🌈 Track Everything Everywhere 🌈

πŸ‘‰#Google unveils OmniMotion: full-length motion tracking for every pixel in every frame of video.

😎Review https://t.ly/Krvw
😎Paper arxiv.org/pdf/2306.05422.pdf
😎Project omnimotion.github.io/
😎Demo omnimotion.github.io/#interactive_demo
😎Code github.com/qianqianwang68/omnimotion
πŸ”₯23❀5🀯3🀩1πŸ’©1