AI with Papers - Artificial Intelligence & Deep Learning
15.2K subscribers
135 photos
247 videos
14 files
1.31K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฅซ Plug 'n' play self-checkout ๐Ÿฅซ

๐Ÿ‘‰#Google's new shelf-checking #AI: recognizing billions of products, even purchased/moved

๐Ÿ˜ŽReview https://bit.ly/3J58hQe
๐Ÿ˜ŽNews https://cloud.google.com/blog/transform/nrf-2023-google-cloud-big-show-big-moment-hybrid-retail
๐Ÿคฏ8๐Ÿ‘7
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸŠ GLIGEN: Grounded T2I Diffusion ๐ŸŠ

๐Ÿ‘‰New (insane๐Ÿคฏ) SOTA in zero-shot layout-to-image generation. Demo available!

๐Ÿ˜ŽReview https://bit.ly/3J0rnHw
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.07093.pdf
๐Ÿ˜ŽCode github.com/gligen/GLIGEN
๐Ÿ˜ŽDemo dev.hliu.cc/gligen_mirror2/
๐Ÿ˜ŽProject gligen.github.io/
๐Ÿ”ฅ9๐Ÿพ2โค1โšก1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿงค Handy: Hands "Pipeline" by #Shopify ๐Ÿงค

๐Ÿ‘‰Shopify open-sourced Handy: hand gestures via #metaquest headsets -> into #Blender

๐Ÿ˜ŽReview https://bit.ly/3Wpkpi2
๐Ÿ˜ŽProject github.com/Shopify/handy
๐Ÿ˜ŽDemo diegomacario.github.io/Hands-In-The-Web/public/index.html
๐Ÿ”ฅ11๐Ÿ‘2๐Ÿ‘2๐Ÿคฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ€ NeRF2NeRF: 3D registration on NeRFs ๐Ÿ€

๐Ÿ‘‰A novel 3D registration that operates directly on NeRFs

๐Ÿ˜ŽReview https://bit.ly/3ZRgz4a
๐Ÿ˜ŽPaper arxiv.org/pdf/2211.01600.pdf
๐Ÿ˜ŽCode github.com/nerf2nerf
๐Ÿ˜ŽProject https://nerf2nerf.github.io/
๐Ÿ˜ŽDataset https://drive.google.com/drive/folders/1jNpwAv1T1ntjIHUMJ1wABePA2Z8_nRRQ
๐Ÿ‘9๐Ÿ”ฅ5
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸŽจ RecolorNeRF: #3D Color Editing ๐ŸŽจ

๐Ÿ‘‰INSANE palette-based color editing of NeRF scenes

๐Ÿ˜ŽReview https://bit.ly/3GYjhfR
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.07958.pdf
๐Ÿ˜ŽProject sites.google.com/view/recolornerf
๐Ÿคฏ10๐Ÿ‘4๐Ÿคฃ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸžOmniObject3D: Realistic 3D Dataset ๐Ÿž

๐Ÿ‘‰Large-vocabulary #3D dataset for realistic perception, reconstruction & generation

๐Ÿ˜ŽReview https://bit.ly/3HlXyjp
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.07525.pdf
๐Ÿ˜ŽProject omniobject3d.github.io/
๐Ÿ”ฅ9๐Ÿ‘4โค1๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ‘‘ OnePose++: One-Shot Pose ๐Ÿ‘‘

๐Ÿ‘‰Open-source (insane) keypoint-free pose pipeline for #AR

๐Ÿ˜ŽReview https://bit.ly/3kF0BdG
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.07673.pdf
๐Ÿ˜ŽProject zju3dv.github.io/onepose_plus_plus
๐Ÿ˜ŽCode github.com/zju3dv/OnePose_Plus_Plus
๐Ÿ˜ŽDataset zjueducn-my.sharepoint.com/personal/12121064_zju_edu_cn/_layouts/15/onedrive.aspx
๐Ÿคฏ11๐Ÿ‘9โค1๐Ÿ”ฅ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ„ BTS: Density Fields from Single View ๐Ÿ„

๐Ÿ‘‰Volumetric scene representation from a single image in challenging conditions

๐Ÿ˜ŽReview https://bit.ly/3wjHDvH
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.07668.pdf
๐Ÿ˜ŽProject fwmb.github.io/bts/
๐Ÿ”ฅ7๐Ÿ‘1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
โšกStyleGAN-T: unlocking Power of GANsโšก

๐Ÿ‘‰#Nvidia unveils StyleGAN-T to regain competitiveness to GANs vs. Diffusive Models

๐Ÿ˜ŽReview https://bit.ly/3HtKxEA
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.09515.pdf
๐Ÿ˜ŽProject sites.google.com/view/stylegan-t
๐Ÿ˜ŽCode github.com/autonomousvision/stylegan-t
๐Ÿ”ฅ9๐Ÿ‘4๐Ÿคฏ4โค1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿช€ NeRF in Time, Space and Appearance ๐Ÿช€

๐Ÿ‘‰From Berkeley k-planes: a white-box model for radiance fields in arbitrary dimensions

๐Ÿ˜ŽReview https://bit.ly/3J8GiiS
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.10241.pdf
๐Ÿ˜ŽProject sarafridov.github.io/K-Planes/
๐Ÿ˜ŽCode github.com/sarafridov/K-Planes
๐Ÿ‘2๐Ÿคฏ1๐Ÿพ1
Media is too big
VIEW IN TELEGRAM
๐Ÿ”ฅ Neural Tracking via Weighted OF ๐Ÿ”ฅ

๐Ÿ‘‰The new SOTA in planar neural tracking is INSANE!

๐Ÿ˜ŽReview https://bit.ly/404gcDs
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.10057.pdf
๐Ÿ˜ŽCode github.com/serycjon/WOFT
๐Ÿ˜ŽProject cmp.felk.cvut.cz/~serycjon/WOFT
๐Ÿคฏ15โค3๐Ÿ‘3๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
โ™ฟ Detecting Vulnerable Pedestrian โ™ฟ

๐Ÿ‘‰ BGSU opens a novel pedestrian dataset for vulnerable people

๐Ÿ˜ŽReview https://bit.ly/3JjVmu2
๐Ÿ˜ŽPaper arxiv.org/pdf/2212.06218.pdf
๐Ÿ˜ŽData github.com/devvansh1997/BGVP
๐Ÿ‘6โค1๐Ÿ”ฅ1
๐Ÿง  SERENA: LLM for Mental Health Support ๐Ÿง 

๐Ÿ‘‰Interactive #AI (in "#chatgpt" style) designed for mental health counseling

๐Ÿ˜ŽReview https://bit.ly/3wtbW37
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.09412.pdf
๐Ÿ˜ŽProject https://serena.chat/
๐Ÿ‘9โค2
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ• MAV3D: #3D Video from Text ๐Ÿ•

๐Ÿ‘‰#META unveils a novel #AI for generating #3D dynamic videos from text

๐Ÿ˜ŽReview https://bit.ly/3XN0zin
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.11280.pdf
๐Ÿ˜ŽProject make-a-video3d.github.io
๐Ÿ”ฅ8๐Ÿ‘3๐Ÿคฃ3โค1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅCutLER: Unsupervised Segmentation ๐Ÿ”ฅ

๐Ÿ‘‰Novel paper by #META on detection & instance segmentation without human annotations

๐Ÿ˜ŽReview https://bit.ly/3DlFiUG
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.11320.pdf
๐Ÿ˜ŽCode github.com/facebookresearch/CutLER
๐Ÿ˜ŽProject people.eecs.berkeley.edu/~xdwang/projects/CutLER
โค10๐Ÿ‘4๐Ÿ”ฅ4๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ˜ CLIP/GPT3-driven Affective Faces ๐Ÿ˜

๐Ÿ‘‰Columbia unveils a neural framework for facial expressions retrieval given the context of the speaker

๐Ÿ˜ŽReview https://bit.ly/3HERna0
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.10939.pdf
๐Ÿ˜ŽProject realtalk.cs.columbia.edu
๐Ÿ˜ŽCode github.com/scottgeng00/realtalk
๐Ÿ”ฅ12โค5๐Ÿ‘1๐Ÿฅฐ1๐Ÿคฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฆ Physics-inspired Computer Vision ๐Ÿฆ

๐Ÿ‘‰UCLA unveils PhyCV, the first Physics-inspired Computer Vision Library

๐Ÿ˜ŽReview https://bit.ly/3HEWozI
๐Ÿ˜ŽCode github.com/JalaliLabUCLA/phycv
๐Ÿ˜ŽProject photonics.ucla.edu/2022/05/12/jalali-lab-open-sources-phycv-a-physics-inspired-computer-vision-library/
๐Ÿคฏ7โค5๐Ÿ‘4๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸŽทAudio-Visual Semantic Segmentation๐ŸŽท

๐Ÿ‘‰A novel problem in #AI: pixel-level segmentation of objects that produce sound in the image frame

๐Ÿ˜ŽReview https://bit.ly/3wFY6dw
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.13190.pdf
๐Ÿ˜ŽProject opennlplab.github.io/AVSBench
๐Ÿ˜ŽCode github.com/OpenNLPLab/AVSBench
๐Ÿคฏ10๐Ÿ‘3๐Ÿ”ฅ2โค1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿš› Text-driven Video Neural Editing ๐Ÿš›

๐Ÿ‘‰A novel text-guided video editing with both appearance/shape

๐Ÿ˜ŽReview https://bit.ly/3YcfMJO
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.13173.pdf
๐Ÿ˜ŽProject text-video-edit.github.io/
๐Ÿ”ฅ12๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
โญ Mono-STAR: Unified Track/3D โญ

๐Ÿ‘‰Real-time 3D unified framework for semantic fusion, tracking, non-rigid deformation, and topological changes

๐Ÿ˜ŽReview https://bit.ly/3Dxvxmx
๐Ÿ˜ŽPaper arxiv.org/pdf/2301.13244.pdf
๐Ÿ˜ŽProject github.com/changhaonan/Mono-STAR-demo
โšก5๐Ÿ‘4๐Ÿ”ฅ4โค1