AI with Papers - Artificial Intelligence & Deep Learning
15.2K subscribers
135 photos
247 videos
14 files
1.31K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯ŽSportsMOT + MixSort = Sport MOTπŸ₯Ž

πŸ‘‰Nanjing just released a MOT dataset for sports scenes + the SOTA code/model for tracking (MixSort)

😎Review https://t.ly/NHUxL
😎Paper arxiv.org/pdf/2304.05170.pdf
😎Code github.com/MCG-NJU/MixSort
😎Project deeperaction.github.io/datasets/sportsmot.html
πŸ”₯12πŸ‘2🀯2❀1🀩1
⚑️Feature Matching at Light Speed⚑️

πŸ‘‰LightGlue is a lightweight feature matcher with high accuracy and blazing fast inference

😎Review https://t.ly/jkecX
😎Paper arxiv.org/pdf/2306.13643.pdf
😎Code github.com/cvg/LightGlue
❀23πŸ”₯6😱4πŸ‘3⚑2🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ•ΉοΈ CoDeF: Video Content Deformation Fields πŸ•ΉοΈ

πŸ‘‰CoDeF is a new type of video representation for video-editing tasks

😎Review https://t.ly/PIVl-
😎Paper arxiv.org/pdf/2308.07926.pdf
😎Project https://qiuyu96.github.io/CoDeF
😎Code https://github.com/qiuyu96/CoDeF
❀18πŸ”₯4πŸ‘2πŸ₯°1🀯1😱1
Hello everybody,
a lot of you asked me to open the comments to better enjoy the posts. I want to follow your suggestion, hope you will enjoy this new mood!

πŸ”₯ NO SPAM
πŸ”₯ NO COMMERCIAL
πŸ”₯ NO UNRESPECTFUL MESSAGEs

🧑JUST AI & SCIENCE

⚠️ BAN AT THE FIRST VIOLATION ⚠️
❀44πŸ‘28πŸ”₯6πŸ‘1🀯1🍾1
AI with Papers - Artificial Intelligence & Deep Learning pinned Β«Hello everybody, a lot of you asked me to open the comments to better enjoy the posts. I want to follow your suggestion, hope you will enjoy this new mood! πŸ”₯ NO SPAM πŸ”₯ NO COMMERCIAL πŸ”₯ NO UNRESPECTFUL MESSAGEs 🧑JUST AI & SCIENCE ⚠️ BAN AT THE FIRST…»
This media is not supported in your browser
VIEW IN TELEGRAM
🦠 Instance-Level Semantics of Cells 🦠

πŸ‘‰TYC: novel dataset for understanding instance-level semantics & motions of cells in microstructures

😎Review https://t.ly/y-4VZ
😎Paper arxiv.org/pdf/2308.12116.pdf
😎Project christophreich1996.github.io/tyc_dataset/
😎Code github.com/ChristophReich1996/TYC-Dataset
😎Data tudatalib.ulb.tu-darmstadt.de/handle/tudatalib/3930
πŸ‘8πŸ”₯3❀1⚑1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🌡POCO: 3D HPS + Confidence🌡

πŸ‘‰ Novel framework for HPS: #3D human body + confidence in a single feed-forward pass

😎Review https://t.ly/cDePe
😎Paper arxiv.org/pdf/2308.12965.pdf
😎Project https://poco.is.tue.mpg.de
πŸ”₯5πŸ‘3❀2🀯1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸŒ† NeO360: NeRF for Sparse Outdoor πŸŒ†

πŸ‘‰#Toyota (+GIT) unveils NeO360: 360β—¦ outdoor scenes from a single or a few posed RGB images

😎Review https://t.ly/JDJZg
😎Paper arxiv.org/pdf/2308.12967.pdf
😎Project zubair-irshad.github.io/projects/neo360.html
❀13πŸ‘3πŸ”₯2πŸ₯°1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯• Scenimefy: I-2-I for anime πŸ₯•

πŸ‘‰S-Lab unveils a novel semi-supervised I-2-I translation framework + HD dataset for anime

😎Review https://t.ly/IsdEG
😎Paper arxiv.org/pdf/2308.12968.pdf
😎Code https://github.com/Yuxinn-J/Scenimefy
😎Project https://yuxinn-j.github.io/projects/Scenimefy.html
πŸ₯°13❀2πŸ”₯1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🐨 Watch Your Steps: Editing by Text 🐨

πŸ‘‰The novel SOTA in image & scene (text) editing via denoising diffusion models

😎Review https://t.ly/fv9wn
😎Paper arxiv.org/pdf/2308.08947.pdf
😎Project ashmrz.github.io/WatchYourSteps
❀4πŸ‘3🀯3πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ’‘ Relighting NeRF πŸ’‘

πŸ‘‰Neural implicit radiance representation for free viewpoint relighting of an object lit by a moving point light

😎Review https://t.ly/J-3_L
😎Project nrhints.github.io
😎Code github.com/iamNCJ/NRHints
😎Paper nrhints.github.io/pdfs/nrhints-sig23.pdf
🀯3πŸ‘2❀1⚑1πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺΆ ReST: Multi-Camera MOT πŸͺΆ

πŸ‘‰Novel reconfigurable two-steps graph model for multi-camera multi object video tracking (MC-MOT)

😎Review https://t.ly/3C5tb
😎Paper arxiv.org/pdf/2308.13229.pdf
😎Code github.com/chengche6230/ReST
πŸ”₯7❀3🀩2
This media is not supported in your browser
VIEW IN TELEGRAM
🌲MagicEdit: Magic Video Edit🌲

πŸ‘‰MagicEdit: explicit disentangling content, structure & motion for Hi-Fi and temporally coherent video editing

😎Report https://t.ly/tREX4
😎Paper arxiv.org/pdf/2308.14749.pdf
😎Project magic-edit.github.io
😎Code github.com/magic-research/magic-edit
πŸ₯°8❀4πŸ‘3πŸ”₯1😱1🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
βœ‚οΈ VideoCutLER: Simple UVIS βœ‚οΈ

πŸ‘‰VideoCutLER is a simple unsupervised video instance segmentation (UVIS) method without relying on optical flows

😎Review https://t.ly/PBBjG
😎Paper arxiv.org/pdf/2308.14710.pdf
😎Project people.eecs.berkeley.edu/~xdwang/projects/CutLER
😎Code github.com/facebookresearch/CutLER/tree/main/videocutler
πŸ”₯8πŸ‘3❀2🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐦 3D Pigeons Pose & Tracking 🐦

πŸ‘‰ 3D-MuPPET: estimate and track 3D poses of pigeons with multiple-views

😎Review https://t.ly/jfAJJ
😎Paper arxiv.org/pdf/2308.15316.pdf
😎Code github.com/alexhang212/3D-MuPPET/
🀣17🀯14πŸ‘4πŸ₯°2❀1🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
🎍RoboTAP: Dense Tracking for Few-Shot Imitation🎍

πŸ‘‰RoboTAP: novel dense tracking representation for robotic arm

😎Review https://t.ly/MCO_V
😎Paper arxiv.org/pdf/2308.15975.pdf
😎Project https://robotap.github.io/
😎Code github.com/deepmind/tapnet
πŸ”₯8πŸ‘2🀯2🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
β›ΊFACET: Fairness in Computer Visionβ›Ί

πŸ‘‰#META AI opens a large, publicly available dataset for classification, detection & segmentation. Potential performance disparities & challenges across sensitive demographic attributes

😎Review https://t.ly/mKn-t
😎Paper arxiv.org/pdf/2309.00035.pdf
😎Dataset https://facet.iss.onetademolab.com/
πŸ”₯10❀6πŸ‘4πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
β™ŠοΈ Doppelgangers in Structures β™ŠοΈ

πŸ‘‰A novel learning-based approach for visual disambiguation: distinguishing illusory matches to produce correct, disambiguated #3D reconstructions

😎Review https://t.ly/9yLot
😎Paper arxiv.org/pdf/2309.02420.pdf
😎Code github.com/RuojinCai/Doppelgangers
😎Project doppelgangers-3d.github.io/
πŸ”₯8πŸ‘3🀯2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸƒ Tracking Anything with Decoupled VOS πŸƒ

πŸ‘‰A novel VOS approach that extends SAM for open-world video segmentation with no user input required

😎Review https://t.ly/xeobR
😎Paper arxiv.org/pdf/2309.03903.pdf
😎Project hkchengrex.com/Tracking-Anything-with-DEVA
😎Code github.com/hkchengrex/Tracking-Anything-with-DEVA
😎Colab https://colab.research.google.com/drive/1OsyNVoV_7ETD1zIE8UWxL3NXxu12m_YZ
πŸ”₯13πŸ‘6🀯4❀2😒1🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ· Diffusive Consistent Video Editing πŸͺ·

πŸ‘‰ Weizmann Institute of Science unveils TokenFlow, a novel text-to-image diffusion model for text-driven video editing

😎Review https://t.ly/ru8km
😎Paper arxiv.org/pdf/2307.10373.pdf
😎Project diffusion-tokenflow.github.io
😎Code github.com/omerbt/TokenFlow
❀9πŸ‘6πŸ”₯2🀯1😱1😒1