AI with Papers - Artificial Intelligence & Deep Learning
15.2K subscribers
135 photos
247 videos
14 files
1.31K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘—DreamPose: Fashion I-2-V DiffusionπŸ‘—

πŸ‘‰ Turning fashion photos into realistic videos via driving pose sequence

😎Review https://bit.ly/3AdNtAN
😎Paper arxiv.org/pdf/2304.06025.pdf
😎Code github.com/johannakarras/DreamPose
😎Project grail.cs.washington.edu/projects/dreampose
🀯11πŸ”₯3❀2πŸ‘2πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯¦ Zip-NeRF: the Anti-Aliasing NeRF πŸ₯¦

πŸ‘‰#Google unveils a novel version of NeRF able to fix the aliasing problem being 22x faster in training than SOTA.

😎Review https://bit.ly/3L1hZ6M
😎Paper arxiv.org/pdf/2304.06706.pdf
😎Project https://jonbarron.info/zipnerf
🀯13πŸ”₯4πŸ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ ALERT: Stable Diffusion XL is out! πŸ”₯

πŸ‘‰SDXL the new generative AI by Stability.AI for images from text. Up to 1024x1024 resolution, for free.

😎More https://bit.ly/41wrh0j
🀯10❀7😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ¬ META's Animated Drawings is out! πŸͺ¬

πŸ‘‰#META unveils an easy-to-use method for animating human-like figures drawn by children.

😎Review https://bit.ly/3mGeQQv
😎Paper arxiv.org/pdf/2303.12741.pdf
😎Project fairanimateddrawings.com
😱16πŸ₯°5πŸ‘4πŸ‘2🀩2⚑1πŸ”₯1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🌻DDS: diffusive text-based image editing🌻

πŸ‘‰Google unveils a novel text-based image editing for modifications of an input image towards a text description.

😎Review https://bit.ly/3L52UBl
😎Paper arxiv.org/pdf/2304.07090.pdf
😎Project delta-denoising-score.github.io
πŸ”₯12❀2πŸ‘2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ… Inpaint Anything: Segmentation + Inpainting πŸͺ…

πŸ‘‰Remove / Fill /Replace anything (also via prompt). "Inpaint Anything", a new paradigm of β€œclicking & filling"

😎Review https://bit.ly/43JNREE
😎Paper arxiv.org/pdf/2304.06790.pdf
😎Code github.com/geekyutao/Inpaint-Anything
πŸ‘16🀯8❀3😒1
Hi friends,
right now I'm flying to NY for a business trip!

πŸ‘‰ Is there anyone studying/working @NYU? I'd love to visit the campus and (eventually) attend to a few lessons about AI/CV/MATH on Monday (or this Friday)

Send me a DM -> @argovision
❀15πŸ‘6🍾5🀯3🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ Track Anything: SAM-powered tracking πŸ”₯

πŸ‘‰ SUSTech VIP Lab proposes TAM, a "novel" video tracker powered by SAM

😎Review https://bit.ly/44jwI4W
😎Paper arxiv.org/pdf/2304.11968.pdf
😎Code github.com/gaomingqi/Track-Anything
πŸ”₯17πŸ‘4🀯2😱2πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
🌱 Segment Everything Everywhere 🌱

πŸ‘‰ Segmenting everything using visual/language prompts (BBs, scribbles, text & audio)

😎Review https://bit.ly/3LEiOmx
😎Paper arxiv.org/pdf/2304.06718.pdf
😎Demo huggingface.co/spaces/xdecoder/SEEM
😎Code github.com/UX-Decoder/Segment-Everything-Everywhere-All-At-Once
πŸ”₯13❀4🀯1🀩1
πŸ¦’ Look mom, I'm a giraffe πŸ¦’

πŸ‘‰ A patent to transpose adversarial patches onto a knitted fabric. Be undetectable or associated with incorrect category such as "animal" (giraffe, zebra, etc)

😎 More: https://bit.ly/3LzjSGV
❀20πŸ‘4🀩4πŸ”₯3πŸ’©3πŸ‘1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🐊 RelPose++: SOTA 6D from 2-8 pics 🐊

πŸ‘‰CMU unveils a novel neural method for 6D camera poses from only 2-8 images

😎Review https://bit.ly/42ioJ6K
😎Paper arxiv.org/pdf/2305.04926.pdf
😎Project amyxlase.github.io/relpose-plus-plus
😎Code github.com/amyxlase/relpose-plus-plus
πŸ”₯16🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ¦• 6D Non-Prehensile Manipulation πŸ¦•

πŸ‘‰#META (+CMU) unveils HACMan, novel 6D non-prehensile manipulation of objects

😎Review https://bit.ly/3NP1jl1
😎Paper arxiv.org/pdf/2305.03942.pdf
😎Project hacman-2023.github.io
πŸ‘6πŸ”₯4🀯3😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ›Έ Virtual Occlusions in #AR πŸ›Έ

πŸ‘‰Niantic (#pokemongo) on a novel approach for virtual assets to appear β€˜sitting among’ the real world objects

😎Review https://bit.ly/3o04wn6
😎Paper arxiv.org/pdf/2305.07014.pdf
😎Project nianticlabs.github.io/implicit-depth
😎Code github.com/nianticlabs/implicit-depth
πŸ”₯11🀯5πŸ‘3⚑1🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
🍿 De-Aging Harrison Ford via SD 🍿

πŸ‘‰Stable Diffusion for Hollywood: preview of the next autotune of entertainment industry. A discussionπŸ‘‡

😎 More: https://bit.ly/41EzaQK
🀯19πŸ”₯9πŸ‘6πŸ’©3⚑1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ° #3D Auto-Reconstruction πŸͺ°

πŸ‘‰AutoRecon: automated discovery & reconstruction of objects from multi-view pics.

😎Review https://bit.ly/3MxI0f4
😎Paper arxiv.org/pdf/2305.08810.pdf
😎Project zju3dv.github.io/autorecon/
😎Code github.com/zju3dv/AutoRecon
πŸ”₯11❀4🀯3πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘š Multi-Layered 3D Garments Animation πŸ‘š

πŸ‘‰S-Lab unveils LayersNet: animating multi-layered garments driven by various external forces, such as human bodies & wind

😎Review https://bit.ly/435b42F
😎Paper arxiv.org/pdf/2305.10418.pdf
😎Project mmlab-ntu.github.io/project/layersnet
πŸ”₯6😱2❀1πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🎫 100% Mask-Free VIS 🎫

πŸ‘‰ETH Z unveils MaskFreeVIS: novel high-performing VIS without any mask annotations.

😎Review https://bit.ly/3Wg7CQB
😎Paper arxiv.org/pdf/2303.15904.pdf
😎Project www.vis.xyz/pub/maskfreevis/
😎Code github.com/SysCV/maskfreevis
πŸ”₯6πŸ‘4🀯2❀1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ€„ Drag-GAN: user-friendly image-manipulation πŸ€„

πŸ‘‰ Manual deforming of (real and generated) images over pose, shape, expression and layout.

😎Review https://bit.ly/3BFyXlR
😎Paper arxiv.org/pdf/2305.10973.pdf
😎Project vcai.mpi-inf.mpg.de/projects/DragGAN
😎Code github.com/XingangPan/DragGAN
πŸ”₯34🀯18❀6πŸ‘4😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ—ΊοΈ AI-generated stereotypical men πŸ—ΊοΈ

πŸ‘‰A thread about generating stereotypical person from 15 countries all around the world. And yes, Italian love Pizza.

😎 More https://bit.ly/3oo0t4c
🀣6❀3🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🍢 AVOS Multiscale Encoder-Decoder ViT 🍢

πŸ‘‰ MED-VT, world's first Multiscale Encoder Decoder Video Transformer for AVOS

😎Review https://bit.ly/3MohFi1
😎Paper arxiv.org/pdf/2304.05930.pdf
😎Project rkyuca.github.io/medvt
😎Code github.com/rkyuca/medvt
πŸ‘13πŸ₯°1