AI with Papers - Artificial Intelligence & Deep Learning
15.2K subscribers
136 photos
250 videos
14 files
1.31K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
🎨 RecolorNeRF: #3D Color Editing 🎨

👉INSANE palette-based color editing of NeRF scenes

😎Review https://bit.ly/3GYjhfR
😎Paper arxiv.org/pdf/2301.07958.pdf
😎Project sites.google.com/view/recolornerf
🤯10👍4🤣1
This media is not supported in your browser
VIEW IN TELEGRAM
🏞OmniObject3D: Realistic 3D Dataset 🏞

👉Large-vocabulary #3D dataset for realistic perception, reconstruction & generation

😎Review https://bit.ly/3HlXyjp
😎Paper arxiv.org/pdf/2301.07525.pdf
😎Project omniobject3d.github.io/
🔥9👍41🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
🍄 BTS: Density Fields from Single View 🍄

👉Volumetric scene representation from a single image in challenging conditions

😎Review https://bit.ly/3wjHDvH
😎Paper arxiv.org/pdf/2301.07668.pdf
😎Project fwmb.github.io/bts/
🔥7👍1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
StyleGAN-T: unlocking Power of GANs

👉#Nvidia unveils StyleGAN-T to regain competitiveness to GANs vs. Diffusive Models

😎Review https://bit.ly/3HtKxEA
😎Paper arxiv.org/pdf/2301.09515.pdf
😎Project sites.google.com/view/stylegan-t
😎Code github.com/autonomousvision/stylegan-t
🔥9👍4🤯41
This media is not supported in your browser
VIEW IN TELEGRAM
🪀 NeRF in Time, Space and Appearance 🪀

👉From Berkeley k-planes: a white-box model for radiance fields in arbitrary dimensions

😎Review https://bit.ly/3J8GiiS
😎Paper arxiv.org/pdf/2301.10241.pdf
😎Project sarafridov.github.io/K-Planes/
😎Code github.com/sarafridov/K-Planes
👍2🤯1🍾1
Media is too big
VIEW IN TELEGRAM
🔥 Neural Tracking via Weighted OF 🔥

👉The new SOTA in planar neural tracking is INSANE!

😎Review https://bit.ly/404gcDs
😎Paper arxiv.org/pdf/2301.10057.pdf
😎Code github.com/serycjon/WOFT
😎Project cmp.felk.cvut.cz/~serycjon/WOFT
🤯153👍3😱1
This media is not supported in your browser
VIEW IN TELEGRAM
Detecting Vulnerable Pedestrian

👉 BGSU opens a novel pedestrian dataset for vulnerable people

😎Review https://bit.ly/3JjVmu2
😎Paper arxiv.org/pdf/2212.06218.pdf
😎Data github.com/devvansh1997/BGVP
👍61🔥1
🧠 SERENA: LLM for Mental Health Support 🧠

👉Interactive #AI (in "#chatgpt" style) designed for mental health counseling

😎Review https://bit.ly/3wtbW37
😎Paper arxiv.org/pdf/2301.09412.pdf
😎Project https://serena.chat/
👍92
This media is not supported in your browser
VIEW IN TELEGRAM
🐕 MAV3D: #3D Video from Text 🐕

👉#META unveils a novel #AI for generating #3D dynamic videos from text

😎Review https://bit.ly/3XN0zin
😎Paper arxiv.org/pdf/2301.11280.pdf
😎Project make-a-video3d.github.io
🔥8👍3🤣31
This media is not supported in your browser
VIEW IN TELEGRAM
🔥CutLER: Unsupervised Segmentation 🔥

👉Novel paper by #META on detection & instance segmentation without human annotations

😎Review https://bit.ly/3DlFiUG
😎Paper arxiv.org/pdf/2301.11320.pdf
😎Code github.com/facebookresearch/CutLER
😎Project people.eecs.berkeley.edu/~xdwang/projects/CutLER
10👍4🔥4🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
😍 CLIP/GPT3-driven Affective Faces 😍

👉Columbia unveils a neural framework for facial expressions retrieval given the context of the speaker

😎Review https://bit.ly/3HERna0
😎Paper arxiv.org/pdf/2301.10939.pdf
😎Project realtalk.cs.columbia.edu
😎Code github.com/scottgeng00/realtalk
🔥125👍1🥰1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🐦 Physics-inspired Computer Vision 🐦

👉UCLA unveils PhyCV, the first Physics-inspired Computer Vision Library

😎Review https://bit.ly/3HEWozI
😎Code github.com/JalaliLabUCLA/phycv
😎Project photonics.ucla.edu/2022/05/12/jalali-lab-open-sources-phycv-a-physics-inspired-computer-vision-library/
🤯75👍4😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🎷Audio-Visual Semantic Segmentation🎷

👉A novel problem in #AI: pixel-level segmentation of objects that produce sound in the image frame

😎Review https://bit.ly/3wFY6dw
😎Paper arxiv.org/pdf/2301.13190.pdf
😎Project opennlplab.github.io/AVSBench
😎Code github.com/OpenNLPLab/AVSBench
🤯10👍3🔥21😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🚛 Text-driven Video Neural Editing 🚛

👉A novel text-guided video editing with both appearance/shape

😎Review https://bit.ly/3YcfMJO
😎Paper arxiv.org/pdf/2301.13173.pdf
😎Project text-video-edit.github.io/
🔥12👍1
This media is not supported in your browser
VIEW IN TELEGRAM
Mono-STAR: Unified Track/3D

👉Real-time 3D unified framework for semantic fusion, tracking, non-rigid deformation, and topological changes

😎Review https://bit.ly/3Dxvxmx
😎Paper arxiv.org/pdf/2301.13244.pdf
😎Project github.com/changhaonan/Mono-STAR-demo
5👍4🔥41
🛋️🛋️ 100% Accurated #3D Labeling 🛋️🛋️

👉#Amazon unveils a novel tool for fine-grained 3D part labeling. Up to 100% accuracy! Paper only😢

😎Review https://bit.ly/3kYpQHQ
😎Paper https://arxiv.org/pdf/2301.10460.pdf
🤯102👍1
This media is not supported in your browser
VIEW IN TELEGRAM
💧FLOW360: 360° Neural Optical Flow💧

👉 The first perceptually realistic 360° video benchmark dataset + SLOF method for OF tracking

😎Review https://bit.ly/3wMZZoX
😎Paper arxiv.org/pdf/2301.11880.pdf
😎Project https://siamlof.github.io
👍7🤯2🔥1
This media is not supported in your browser
VIEW IN TELEGRAM
🐓DREAMIX:General Diffusive Video Editor🐓

👉#Google unveils the first diffusion-based method able to perform text-based motion/appearance editing of general videos

😎Review https://bit.ly/3I3Hq6B
😎Paper arxiv.org/pdf/2302.01329.pdf
😎Project dreamix-video-editing.github.io/
🤯24😱3👍21