AI with Papers - Artificial Intelligence & Deep Learning
15.2K subscribers
135 photos
247 videos
13 files
1.31K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
đŸŒģ Extending Mona Lisa with AI đŸŒģ

👉 A guy on Reddit extends Mona Lisa Painting with #Photoshop AI. The result is surprising.

😎More https://t.ly/j_2r
đŸ¤¯20👍5🤩4đŸ”Ĩ3😱2đŸ¤Ŗ2⚡1
This media is not supported in your browser
VIEW IN TELEGRAM
🏸 Segment Anything in HQ 🏸

👉HQ-SAM: SAM with the ability to accurately segment objects, maintaining promptable design, efficiency, zero-shot generalizability

😎Review https://t.ly/GxX5B
😎Paper arxiv.org/pdf/2306.01567.pdf
😎Models github.com/SysCV/SAM-HQ
đŸ”Ĩ18👍4đŸ¤¯1😱1😍1
This media is not supported in your browser
VIEW IN TELEGRAM
🌈 Track Everything Everywhere 🌈

👉#Google unveils OmniMotion: full-length motion tracking for every pixel in every frame of video.

😎Review https://t.ly/Krvw
😎Paper arxiv.org/pdf/2306.05422.pdf
😎Project omnimotion.github.io/
😎Demo omnimotion.github.io/#interactive_demo
😎Code github.com/qianqianwang68/omnimotion
đŸ”Ĩ23❤5đŸ¤¯3🤩1💩1
This media is not supported in your browser
VIEW IN TELEGRAM
đŸ‘ī¸ Scene Five: Through Her Eyes đŸ‘ī¸

👉 #3D scene reconstruction of what a person is observing using only the reflections of their eyes

😎Review https://t.ly/uBO6
😎Paper arxiv.org/pdf/2306.09348.pdf
😎Project https://world-from-eyes.github.io/
đŸ¤¯28đŸ”Ĩ12💩2🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
đŸ§ŋ NeRF-Supervised Deep Stereo đŸ§ŋ

👉A novel pioneering pipeline for training deep stereo networks WITH NO ground-truth

😎Review https://t.ly/c7j-
😎Project nerfstereo.github.io/
😎Dataset https://amsacta.unibo.it/id/eprint/7218/
😎Code github.com/fabiotosi92/NeRF-Supervised-Deep-Stereo
😎Paper https://openaccess.thecvf.com/content/CVPR2023/papers/Tosi_NeRF-Supervised_Deep_Stereo_CVPR_2023_paper.pdf
đŸĨ°8🤩3❤1👍1💩1😍1
This media is not supported in your browser
VIEW IN TELEGRAM
đŸĢŖ Text-Guided Adversarial Makeup đŸĢŖ

👉Novel facial privacy protection via adversarial latent codes. Makeup vs Face Recognition.

😎Review https://t.ly/pBCP
😎Paper arxiv.org/pdf/2306.10008.pdf
😎Code github.com/fahadshamshad/Clip2Protect
❤6👍1đŸ”Ĩ1đŸĨ°1💩1
Media is too big
VIEW IN TELEGRAM
đŸĻˇ Few-Shot Geometry-Aware Keypoints đŸĻˇ

👉UBC (+Flawless AI) unveils the new SOTA in semantic keypoints localization. Suitable for faces, animals, cars, mouth, teeth & more

😎Review https://t.ly/-0qN
😎Paper arxiv.org/pdf/2303.17216.pdf
😎Project xingzhehe.github.io/FewShot3DKP/
đŸ¤¯10👍4❤2⚡2👏2🤩2đŸ”Ĩ1
This media is not supported in your browser
VIEW IN TELEGRAM
🚔 Fooling Neural Forensic Classifiers 🚔

👉Adversarial faces able to fool the forensic classifiers, while remaining undetectable by humans

😎Review https://t.ly/33Cc
😎Paper arxiv.org/pdf/2306.13091.pdf
😎Project koushiksrivats.github.io/face_attribute_attack
😎Code github.com/koushiksrivats/face_attribute_attack
đŸ˜ĸ6❤4👏2😱2🍾2👍1đŸ¤¯1😍1
panohead_overview-min.gif
24.3 MB
đŸĨ PanoHead: 3D Full-Head Synthesis đŸĨ

👉#ByteDance (+UW-M) unveils PanoHead: 360â—Ļ view-consistent portraits from a single-view image

😎Review https://t.ly/MrLNR
😎Paper arxiv.org/pdf/2303.13071.pdf
😎Project sizhean.github.io/panohead
😎Code github.com/sizhean/panohead
đŸ”Ĩ7❤4đŸ¤¯3😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🔮SAM-PT: Segment Anything+Tracking🔮

👉SAM-PT is the first method to utilize sparse point propagation for Video Object Segmentation (VOS).

😎Review https://t.ly/QLMG
😎Paper arxiv.org/pdf/2307.01197.pdf
😎Project www.vis.xyz/pub/sam-pt/
😎Code github.com/SysCV/sam-pt
đŸ”Ĩ14❤7đŸ¤¯3👍1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
đŸĒŠ DISCO: Human Dance Generation đŸĒŠ

👉NTU (+ #Microsoft) unveils DISCO: a big step towards the Human Dance Generation.

😎Review https://t.ly/cNGX
😎Paper arxiv.org/pdf/2307.00040.pdf
😎Project disco-dance.github.io/
😎Code github.com/Wangt-CN/DisCo
đŸ”Ĩ13đŸĨ°4😍2⚡1👍1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
đŸ›Ŗī¸ STAR.: 3D-tracking w/ attention paradigm đŸ›Ŗī¸

👉#Mercedes STAR: e2e 3D object tracking that follows the tracking-by-attention paradigm

😎Review https://t.ly/JoGj
😎Paper arxiv.org/pdf/2306.17602.pdf
😎Project simondoll.github.io/publications/star_track
👍14đŸ”Ĩ1đŸĨ°1👏1
This media is not supported in your browser
VIEW IN TELEGRAM
🍡 Text2Cinemagraphs: Cinemagraph from text 🍡

👉CMU (+ #Snap) unveils a fully automated method for creating cinemagraphs from text descriptions

😎Review https://t.ly/BwZs6
😎Paper arxiv.org/pdf/2307.03190.pdf
😎Project text2cinemagraph.github.io/website
😎Code github.com/text2cinemagraph/text2cinemagraph
❤12đŸ¤¯3😱1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
đŸ”ĨTest-Time Training on fire đŸ”Ĩ

👉Extending the TTT to the streaming setting. Suitable for Panoptic, Instance & Colorization.

😎Review https://t.ly/eZYA
😎Paper arxiv.org/pdf/2307.05014.pdf
😎Project https://video-ttt.github.io/
😎Code github.com/renwang435/video-ttt-release
đŸ”Ĩ10👍3⚡1đŸ¤¯1
This media is not supported in your browser
VIEW IN TELEGRAM
🃏 Deepfake via casual self-scan 🃏

👉TAU presents a novel approach to reenact an ID using only a casual self-scan

😎Review https://t.ly/9T8Wi
😎Paper arxiv.org/pdf/2307.06307.pdf
😎Project arielazary.github.io/PGR
đŸ¤¯7👍6❤5đŸ”Ĩ1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
đŸŽĒ Extreme Human Pose Estimation đŸŽĒ

👉RePoGen: novel synthetic data generator of extreme/realistic poses of humans

😎Review https://t.ly/ecBvM
😎Paper arxiv.org/pdf/2307.06737.pdf
😎Project mirapurkrabek.github.io/RePoGen-paper
😎Code github.com/MiraPurkrabek/RePoGen
đŸ”Ĩ12👍2👏1đŸ¤¯1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
💡 DATID-3D: Text-to-3D Generation 💡

👉 A novel domain adaptation method for 3D via text-to-image diffusion. 🤗-Demo available!

😎Review https://t.ly/TCL-B
😎Paper arxiv.org/pdf/2211.16374.pdf
😎Project gwang-kim.github.io/datid_3d/
😎Code github.com/gwang-kim/DATID-3D
🤗 huggingface.co/spaces/gwang-kim/DATID-3D
😎Colab colab.research.google.com/drive/1e9NSVB7x_hjz-nr4K0jO4rfTXILnNGtA?usp=sharing
đŸ¤¯5
This media is not supported in your browser
VIEW IN TELEGRAM
đŸ§¯Neural Focal Modulation VARđŸ§¯

👉A novel architecture for video recognition that models both local/global context

😎Review https://t.ly/rF_fk
😎Paper arxiv.org/pdf/2307.06947.pdf
😎Project talalwasim.github.io/Video-FocalNets
😎Code github.com/TalalWasim/Video-FocalNets
đŸ”Ĩ8⚡1👏1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🐈 Gen-AI as representation learner 🐈

👉DreamTeacher: novel self-supervised feats. representation learning framework that utilizes gen-nets for pre-training downstream image backbones

😎Review https://t.ly/RL8iG
😎Paper arxiv.org/pdf/2307.07487.pdf
😎Project research.nvidia.com/labs/toronto-ai/DreamTeacher
đŸ”Ĩ9👍2đŸ¤¯1
This media is not supported in your browser
VIEW IN TELEGRAM
☔ #SelfDriving? It's all about weather! ☔

👉Novel self-supervised MDE method to handle adverse weather in real-world autonomous driving

😎Review https://t.ly/tcLQW
😎Paper arxiv.org/pdf/2307.08357.pdf
😎Project kieran514.github.io/Robust-Depth-Project/
❤7👍3đŸ¤¯1😱1