AI with Papers - Artificial Intelligence & Deep Learning
15.2K subscribers
136 photos
250 videos
14 files
1.31K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
đŸ’Ĩ🚗 CrashCar101: Generative Damaged CarsđŸ’Ĩ🚗

👉 CrashCar101: procedural generation pipeline that damages 3D car models to obtain synthetic damaged cars paired with pixel-accurate annotations

👉 Review https://t.ly/pITHm
👉 Paper https://lnkd.in/dzp6q3T5
👉 Project https://lnkd.in/daRXg73N
❤7👍1đŸ”Ĩ1đŸ¤¯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐓 Emu: image edit / video gen. 🐓

👉#Meta the new SOTA in text-to-video generation and instruction-based image editing

👉 Review https://t.ly/PMTBc
👉 Paper (images): https://lnkd.in/eVadH-QS
👉 Project https://lnkd.in/eG8eWUJY
👉 Paper (video): https://lnkd.in/eVadH-QS
👉 Project https://lnkd.in/eu6Zu6gp
đŸ”Ĩ8đŸ¤¯2👍1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
đŸŒĻī¸ 100+ GPU weather training đŸŒĻī¸

👉#NVIDIA just released Makani: massively parallel training of weather and climate prediction models on 100+ GPUs and to enable the development of the next generation of weather and climate models.

👉 Review https://t.ly/jageY
👉 Code https://lnkd.in/d4NFZ5xi
❤23đŸ¤¯7⚡1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
đŸŋ Segmenting anything in 3D đŸŋ

👉 OmniSeg3D: omniversal segmentation method aims for segmenting anything in 3D all at once.

👉Review https://t.ly/Q0jrK
👉Paper https://lnkd.in/d9qpxXY9
👉Project https://oceanying.github.io/OmniSeg3D
👉Code (soon)
❤17đŸ”Ĩ7👍4đŸ¤¯2😱2👏1
This media is not supported in your browser
VIEW IN TELEGRAM
đŸ”ŗ SOTA Semantic Boundary đŸ”ŗ

👉Mobile-Seed, a lightweight, dual-task framework tailored for simultaneous semantic segmentation and boundary detection.

👉Review https://t.ly/GsArZ
👉Project whu-usi3dv.github.io/Mobile-Seed/
👉Paper arxiv.org/pdf/2311.12651.pdf
👉Code github.com/WHU-USI3DV/Mobile-Seed
❤5👍1đŸ”Ĩ1đŸ¤¯1😱1
đŸ§ŋ SOTA Model-aware 3D Gaze đŸ§ŋ

👉 Novel hybrid approach that outputs 3D eye model, semantic segmentation, cam-intrinsic & pose. Only 2D eye semantic segmentation masks and fewer 3D gaze labels for supervision.

👉Review https://t.ly/AdKRf
👉Paper https://lnkd.in/dWb9GHPh
👉Code https://lnkd.in/dfAWFVky
đŸ”Ĩ11👍3đŸ¤¯3❤1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
đŸĻ–T-Rex: Counting by Visual PromptingđŸĻ–

👉T-Rex: a novel interactive object counting model to detect and count any objects. Impressive results!

👉Review https://t.ly/4SfFX
👉Project https://lnkd.in/dVtEndHv
👉Paper https://lnkd.in/dBGQsbdP
👉Code (not announced, but an empty repo exists): https://lnkd.in/dnZnGRUn
👍16đŸ”Ĩ15❤4đŸ¤¯2😱1
This media is not supported in your browser
VIEW IN TELEGRAM
đŸ”Ĩ Stable (Stability.AI) Video Diffusion đŸ”Ĩ

👉 #StabilityAI released Stable Video Diffusion: latent video diffusion model for high-resolution, SOTA text-to-video and image-to-video generation

👉 Review https://t.ly/XwHys
👉 Code https://lnkd.in/dQw_yNuV
👉 Paper https://lnkd.in/dHn6f787
đŸ”Ĩ17👍6đŸ¤¯3❤1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🎡 Panoptic Video Scene Graph 🎡

👉Combining video scene graph generation w/ panoptic segmentation for holistic video understanding. Novel HQ dataset with fine, temporal scene graph annotations & panoptic segmentation. Code released!đŸ”Ĩ

👉Review https://t.ly/tckDT
👉Project jingkang50.github.io/PVSG/
👉Paper arxiv.org/pdf/2311.17058.pdf
👉Code github.com/LilyDaytoy/OpenPVSG
👉Tool github.com/lilyDaytoy/PVSGAnnotation
đŸ”Ĩ7👍4❤3đŸ¤¯1
NebulOS.pdf
5.3 MB
đŸŒŗ NebulOS: (more than) Green AI đŸŒŗ

👉A novel hardware-aware Training-Free NAS approach that considers both training-free metrics & HW constraints, aiming to find the optimal balance between validation accuracy & energy consumption. 🚀

👉Review https://t.ly/Ozso1
👉Project sites.google.com/view/nebulos
👉Code github.com/fracapuano/NebulOS
👉Video https://lnkd.in/exN4Q2Fu
👉Hugging Face https://lnkd.in/eyCcPEPc
❤5🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🧱 Material Palette from Images 🧱

👉A novel problem in #AI: material extraction from a real-world image without any prior knowledge đŸ¤¯

👉Discussion https://t.ly/AIWs-
👉Paper https://lnkd.in/dBFAVWPF
👉Project https://lnkd.in/dV5jK8Sm
👉Code https://lnkd.in/dNhMnfFb
👉Dataset (coming) ...
❤9👍2đŸ”Ĩ1đŸĨ°1đŸ¤¯1
This media is not supported in your browser
VIEW IN TELEGRAM
👑 HD Generative #AI With No $$👑

👉DemoFusion: a novel approach for HD image generation w/ no money. Progressive Upscaling, Skip Residual, & Dilated Sampling to achieve higher-resolution ever đŸ”Ĩ

👉Review https://t.ly/sIqDV
👉Paper https://lnkd.in/deDt-zcK
👉Project https://lnkd.in/dFGj47Xw
👉Code https://lnkd.in/dY3UcXwp
👍4❤2đŸ¤¯2👏1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🍡 Animate Anyone: new SOTA! 🍡

👉Alibaba unveils Animate Anyone: novel #AI for transforming character images into animated videos controlled by desired pose sequences. Animating any character image into a video, unconstrained by specific domains 🚀

👉Review https://t.ly/qCahZ
👉Paper https://lnkd.in/d-zi8EZ6
👉Project https://lnkd.in/djwjQRvq
👉Repo https://lnkd.in/dDMkjnKz
đŸ¤¯22👍8đŸ”Ĩ4⚡1❤1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🔎 Generative Powers of Ten 🔍

👉A text-to-image model to generate consistent content across multiple image scales, enabling extreme semantic zooms into a scene. From universe to a human cell đŸ¤¯

👉Review https://t.ly/2DG44
👉Paper https://lnkd.in/eDcSpU59
👉Project https://lnkd.in/e6NKu8n9
đŸ¤¯21❤4đŸ”Ĩ3👏2😱1
Hello everybody,
a lot of you asked me to re-open the sharing of the contents to involve more people. I want to follow your suggestion, hope you will enjoy this new mood!

👍 FREE TO FORWARD TO OTHER TELEGRAM CHANNELS

đŸ”Ĩ NO COPY OF THE POSTS
đŸ”Ĩ NO COMMERCIAL USAGE
đŸ”Ĩ NO UNRESPECTFUL USAGE

âš ī¸ UNDO THE FORWARDING OPTION AT THE FIRST VIOLATION âš ī¸
❤19👍10👏3đŸĨ°1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🩰 Magic Animating Human 🩰

👉MagicAnimate: the new SOTA in human animation. Code available: let's dance!

👉Review https://t.ly/Oq7Za
👉Paper https://lnkd.in/dSUbGgCs
👉Project https://lnkd.in/dkVFf-SV
👉Code https://lnkd.in/dj2dbzdg
👉Demo https://lnkd.in/dHEKPE9q
đŸ¤¯6❤2👍1đŸ”Ĩ1đŸĨ°1
This media is not supported in your browser
VIEW IN TELEGRAM
đŸ”Ĩ EfficientSAM: 20x faster Segment Anything đŸ”Ĩ

👉Meta AI Research unveils a novel family of SAM-like models, light-weight SAM models with SOTA quality-efficiency trade-offs. Up to 20x faster!

👉Review https://t.ly/966QS
👉Paper https://lnkd.in/duijp_Rh
👉Project https://lnkd.in/dW-p2CuH
👉Code https://lnkd.in/dAbZaB2t
👉Demo https://lnkd.in/d-tjKiUd
đŸ”Ĩ15❤4👍4đŸ¤¯2
This media is not supported in your browser
VIEW IN TELEGRAM
đŸĢļ3D Hands with TransformersđŸĢļ

👉 HaMeR is a robust and accurate Hand Mesh Recovery from images and video frames, based on Transformer architecture. It's the new SOTA.

👉Review https://t.ly/YtAW8
👉Paper https://arxiv.org/pdf/2312.05251.pdf
👉Project https://geopavlakos.github.io/hamer
👉Demo huggingface.co/spaces/geopavlakos/HaMeR
👉Colab colab.research.google.com/drive/1rQbQzegFWGVOm1n1d-S6koOWDo7F2ucu
👍10❤1👏1đŸ¤¯1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
đŸĒŠ DreaMoving: Human Dancer đŸĒŠ

👉Alibaba strikes again with DreaMoving: a diffusion-based controllable video generation framework to produce HQ customized human videos.

👉Review https://t.ly/BD_Yf
👉Paper https://lnkd.in/gepP6Rjw
👉Project https://lnkd.in/gwm72cfS
👉Repo (empty) https://lnkd.in/gsc2Qt-F
👍7💩6❤2đŸĨ°1
This media is not supported in your browser
VIEW IN TELEGRAM
📲 EdgeSAM: Mobile 40x SAM 📲

👉A novel hyper-optimized version of SAM for mobile devices such as #Iphone. Pure CNNs backbone (better suitable for ANE), up to 40x faster. Code available 😉

👉Review https://t.ly/m_vLH
👉Paper https://lnkd.in/gHZVZN2x
👉Project https://lnkd.in/gK8qEK8p
👉Repo https://lnkd.in/gj6YAGNv
👉Hugging Face https://lnkd.in/gUUHJvxz
đŸ”Ĩ20⚡2❤2🤩1