AI with Papers - Artificial Intelligence & Deep Learning
15.2K subscribers
136 photos
248 videos
14 files
1.31K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
😍 CLIP/GPT3-driven Affective Faces 😍

👉Columbia unveils a neural framework for facial expressions retrieval given the context of the speaker

😎Review https://bit.ly/3HERna0
😎Paper arxiv.org/pdf/2301.10939.pdf
😎Project realtalk.cs.columbia.edu
😎Code github.com/scottgeng00/realtalk
🔥125👍1🥰1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🐦 Physics-inspired Computer Vision 🐦

👉UCLA unveils PhyCV, the first Physics-inspired Computer Vision Library

😎Review https://bit.ly/3HEWozI
😎Code github.com/JalaliLabUCLA/phycv
😎Project photonics.ucla.edu/2022/05/12/jalali-lab-open-sources-phycv-a-physics-inspired-computer-vision-library/
🤯75👍4😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🎷Audio-Visual Semantic Segmentation🎷

👉A novel problem in #AI: pixel-level segmentation of objects that produce sound in the image frame

😎Review https://bit.ly/3wFY6dw
😎Paper arxiv.org/pdf/2301.13190.pdf
😎Project opennlplab.github.io/AVSBench
😎Code github.com/OpenNLPLab/AVSBench
🤯10👍3🔥21😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🚛 Text-driven Video Neural Editing 🚛

👉A novel text-guided video editing with both appearance/shape

😎Review https://bit.ly/3YcfMJO
😎Paper arxiv.org/pdf/2301.13173.pdf
😎Project text-video-edit.github.io/
🔥12👍1
This media is not supported in your browser
VIEW IN TELEGRAM
Mono-STAR: Unified Track/3D

👉Real-time 3D unified framework for semantic fusion, tracking, non-rigid deformation, and topological changes

😎Review https://bit.ly/3Dxvxmx
😎Paper arxiv.org/pdf/2301.13244.pdf
😎Project github.com/changhaonan/Mono-STAR-demo
5👍4🔥41
🛋️🛋️ 100% Accurated #3D Labeling 🛋️🛋️

👉#Amazon unveils a novel tool for fine-grained 3D part labeling. Up to 100% accuracy! Paper only😢

😎Review https://bit.ly/3kYpQHQ
😎Paper https://arxiv.org/pdf/2301.10460.pdf
🤯102👍1
This media is not supported in your browser
VIEW IN TELEGRAM
💧FLOW360: 360° Neural Optical Flow💧

👉 The first perceptually realistic 360° video benchmark dataset + SLOF method for OF tracking

😎Review https://bit.ly/3wMZZoX
😎Paper arxiv.org/pdf/2301.11880.pdf
😎Project https://siamlof.github.io
👍7🤯2🔥1
This media is not supported in your browser
VIEW IN TELEGRAM
🐓DREAMIX:General Diffusive Video Editor🐓

👉#Google unveils the first diffusion-based method able to perform text-based motion/appearance editing of general videos

😎Review https://bit.ly/3I3Hq6B
😎Paper arxiv.org/pdf/2302.01329.pdf
😎Project dreamix-video-editing.github.io/
🤯24😱3👍21
This media is not supported in your browser
VIEW IN TELEGRAM
🦚 MOSE: coMplex video Object SEgmentation 🦚

👉Novel Dataset for VOS is out! SOTA method on DAVIS is only 59.4% on MOSE

😎Review https://bit.ly/40yzSzW
😎Paper arxiv.org/pdf/2302.01872.pdf
😎Project henghuiding.github.io/MOSE/
😎Code github.com/henghuiding/MOSE-api
7👍2🔥2
This media is not supported in your browser
VIEW IN TELEGRAM
🌘 Gen-1: next-gen Generative #AI 🌘

👉#Runway unveils Gen-1: the next step forward for Generative AI. Registration available for beta -> hurry up!

😎Review https://bit.ly/3YqQYh8
😎Paper arxiv.org/pdf/2302.03011.pdf
😎Project https://research.runwayml.com/gen1
🤯10😱31👍1🔥1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🗿DirectMHP: Multi-Head Pose Estimation🗿

👉Novel E2E multi-person head pose estimation (MPHPE) under full-range angles

😎Review https://bit.ly/3HJubXg
😎Paper arxiv.org/pdf/2302.01110.pdf
😎Code github.com/hnuzhy/DirectMHP
🔥13👍1
This media is not supported in your browser
VIEW IN TELEGRAM
🧱 LEGO-Net: Objects in Rooms 🧱

👉Transformer-based iterative method for rearrangement of objects in messy rooms

😎Review https://bit.ly/3HR0fs6
😎Paper arxiv.org/pdf/2301.09629.pdf
😎Project ivl.cs.brown.edu/#/projects/lego-net
🔥11🤯4
This media is not supported in your browser
VIEW IN TELEGRAM
🎃 In-N-Out: 3D-aware OOD video editing 🎃

👉Novel 3D-aware video editing able to manipulate OOD objects (e.g. heavy makeup, accessories)

😎Review https://bit.ly/3jN0CMu
😎Paper arxiv.org/pdf/2302.04871.pdf
😎Project https://in-n-out-3d.github.io
🔥42🤯2👍1
This media is not supported in your browser
VIEW IN TELEGRAM
🥸 MEGANE: Generative Morphable Eyeglass 🥸

👉#META unveils the most advanced #3D compositional morphable AI for eyeglasses (HD geometry/photometric interaction)

😎Review https://bit.ly/3jOWifu
😎Paper arxiv.org/pdf/2302.04868.pdf
😎Project junxuan-li.github.io/megane
🔥9🤯3👍2🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
💘 3D-aware Blending with NeRF 💘

👉Novel 3D-aware blending method via generative NeRFs

😎Review https://bit.ly/3lBEJA2
😎Paper arxiv.org/pdf/2302.06608.pdf
😎Project blandocs.github.io/blendnerf
😎Code github.com/naver-ai/BlendNeRF
8
This media is not supported in your browser
VIEW IN TELEGRAM
🌅 Semantics-guided natural synthesis 🌅

👉Alibaba #AI unveils a novel semantics-guided synthesis of natural scenes

😎Review https://bit.ly/4115MVJ
😎Paper arxiv.org/pdf/2302.07224.pdf
😎Project zju3dv.github.io/paintingnature
👍5🔥1🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
🦞 SOTA ALERT: YOWOv2 is out! 🦞

👉 The 2nd-gen of YOWO, real-time detection of spatio-temporal actions

😎Review https://bit.ly/3IscY60
😎Paper arxiv.org/pdf/2302.06848v1.pdf
😎Code github.com/yjh0410/YOWOv2
🔥17👍2
This media is not supported in your browser
VIEW IN TELEGRAM
📬 DIVOTrack: crossview MOT dataset 📬

👉 DIVOTrack + CrossMOT: the ultimate solution for MOT in realistic scenario

😎Review https://bit.ly/3YSFZgL
😎Paper arxiv.org/pdf/2302.07676.pdf
😎Code github.com/shengyuhao/DIVOTrack
🔥6👍2🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
🦩 One-Shot Face via LSs of StyleGAN2 🦩

👉 Novel video generation framework with edits, facial motions, deformations & identity

😎Review https://bit.ly/3xuChhF
😎Paper arxiv.org/pdf/2302.07848.pdf
😎Project trevineoorloff.github.io/FaceVideoReenactment_HybridLatents.io/
🤯3😱21👍1