AI with Papers - Artificial Intelligence & Deep Learning
15.3K subscribers
136 photos
251 videos
14 files
1.32K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿš GTR: Global Tracking Transformers ๐Ÿš

๐Ÿ‘‰UTexas + Apple: transformer for global multi-object tracking

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…GTR operates on any object
โœ…Few frames->global trajectories
โœ…SOTA on detectors for any object
โœ…Code under Apache License 2.0

More: https://bit.ly/3DiqkxF
๐Ÿ”ฅ7๐Ÿ‘2๐Ÿคฏ2๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿง E2E Perception for #selfdrivingcars๐Ÿง 

๐Ÿ‘‰HybridNets: multi-task net with several key optimizations

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…End-to-end perception network
โœ…Traffic, lane, object detection
โœ…Drivable segmentation area
โœ…Real-time on embedded systems
โœ…Source code under MIT License

More: https://bit.ly/3JMk8Az
๐Ÿ‘8โค4๐Ÿ‘2๐Ÿคฏ1๐Ÿ˜ฑ1
Media is too big
VIEW IN TELEGRAM
๐Ÿ›ฉ๏ธSmart Parking with UAVs๐Ÿ›ฉ๏ธ

๐Ÿ‘‰A novel methodology to monitor car parking areas in real-time via Drones/UAVs

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…YoloV3 + DeepSort tracker
โœ…Vehicle detection/tracking
โœ…Occupancy estimation via RT
โœ…Four blocks, unique pipeline

More: https://bit.ly/3iJD8nm
โค8๐Ÿ‘5๐Ÿฅฐ1๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ‘• Detecting Events via #AI ๐Ÿ‘•

๐Ÿ‘‰Localizing object states & corresponding state-modifying actions

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…SS-learning state-modifying
โœ…Noise adaptive weighting
โœ…ChangeIt: 2.6k+ hrs , 34k+ changes
โœ…Dataset, code, and model!

More: https://bit.ly/3uBwxkj
๐Ÿ‘7๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸŒˆ๐ŸŒˆ Interactive Neural Labelling ๐ŸŒˆ๐ŸŒˆ

๐Ÿ‘‰Dense labelling of geometry, color & semantics via #3D neural field

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…No training data
โœ…Dense labeling
โœ…Classes on the fly
โœ…Labelling at a scale

More: https://bit.ly/36Y0faQ
๐Ÿ”ฅ4๐Ÿ‘1๐Ÿคฏ1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
โ™Ÿ๏ธNeural RGB-D Reconstructionโ™Ÿ๏ธ

๐Ÿ‘‰Novel approach for #3D mixing implicit surface representations with NeRFs

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…RGB-D based reconstruction
โœ…Leveraging color & depth
โœ…Depth into the NeRF
โœ…Pose & camera refinement

More: https://bit.ly/3iN6e54
๐Ÿ”ฅ5๐Ÿ‘2๐Ÿคฏ2๐Ÿคฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฆ“ Hyper-Fast Refinement ๐Ÿฆ“

๐Ÿ‘‰SharpContour: novel contour-based refinement for semantic segmentation

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Instance-aware Point Classifier
โœ…Deforming by discrete updating
โœ…Estimating offsets independently
โœ…Source code soon available!

More: https://bit.ly/3qL04GY
๐Ÿ‘5๐Ÿ”ฅ4๐Ÿคฏ1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฅ— Neural Mesh via Text only ๐Ÿฅ—

๐Ÿ‘‰Zero-shot generation of 3D model using only a target text prompt

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…ZS 3D model with text only
โœ…ZS text-guided generation
โœ…Meshes with texture/normal
โœ…Differentiable LLS implementation

More: https://bit.ly/3u0qnvb
๐Ÿคฏ8๐Ÿ‘1๐Ÿ”ฅ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿช†#3D, Materials, and Lighting from 2D๐Ÿช†

๐Ÿ‘‰Nvidia: topology, materials & map lighting jointly from 2D. INSANE ๐Ÿ˜ฎ

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Topology, materials and lighting
โœ…Meshes with materials/lighting
โœ…Compact volumetric texturing
โœ…Differentiable all-frequency lighting
โœ…Code under #NVIDIA License

More: https://bit.ly/3IUoF2t
๐Ÿ‘5๐Ÿ‘1๐Ÿคฏ1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸœRef-NeRF for extreme realism๐Ÿœ

๐Ÿ‘‰Ref-NeRF: reflected radiance & structures via collection of spatially-varying scene properties

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Realism and accuracy
โœ…Replacing NeRFโ€™s params
โœ…Regularization of volume density
โœ…Integrated Directional Encoding

More: https://bit.ly/3tTlS5l
๐Ÿ‘4๐Ÿคฏ2๐Ÿ”ฅ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸฆงOFA for all: Cross, Vision, Language๐Ÿฆง

๐Ÿ‘‰Unified multimodal model for image generation, visual grounding, etc.

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Sequence-to-sequence learning
โœ…Image Captioning / Generation
โœ…Visual Grounding / Classification
โœ…Text-to-Image Generation
โœ…Visual Question Answering

More: https://bit.ly/3wSTGlc
๐Ÿ‘7๐Ÿคฏ6๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸฟOld Films Back to Life with #AI๐Ÿฟ

๐Ÿ‘‰Recurrent transformer network (RTN) to restore heavily degraded old films

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Transformer blocks for spatial
โœ…Knowledge from adjacent frames
โœ…Color from keyframes to whole clip
โœ…Source code available in days!

More: https://bit.ly/3wZbV8y
โค12๐Ÿ‘2๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸŠNeural Head #Avatars from RGB๐ŸŠ

๐Ÿ‘‰Novel neural representation for animatable head avatar

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Novel articulated human head
โœ…Full-geometry reconstruction
โœ…Differentiable optimization pipeline
โœ…Disentanglement of shape/color

More: https://bit.ly/3DxUGMI
๐Ÿ”ฅ3๐Ÿคฏ2๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸŒถ๏ธ MyStyle: personal generative #AI ๐ŸŒถ๏ธ

๐Ÿ‘‰Personalized deep generation with a few shots of a person

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Small set of portraits(โˆผ100)
โœ…Local, low-dim, personal manifold
โœ…Personal #AI for ill-posed tasks
โœ…SOTA vs. previous few-shots

More: https://bit.ly/3wWMwMu
๐Ÿ”ฅ5๐Ÿ‘4๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฆ† GAN + Dense Map ๐Ÿฆ†

๐Ÿ‘‰CoordGAN: structure-texture disentangled GAN with dense correspondence map

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Novel coordinate space
โœ…Warping to learn coordinate
โœ…Encoder for structure representation
โœ…HQ structure/texture editable images

More: https://bit.ly/3DOlOaB
๐Ÿคฏ4โค2๐Ÿ”ฅ2๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
โš“Unified shape & non-rigid motionโš“

๐Ÿ‘‰CaDeX: SOTA in both shape & non-rigid motion

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Canonical Deformation Coordinate Space
โœ…Shape + non rigid motion representation
โœ…Factorization of def-homeomorphisms
โœ…Cycle consistency, topology & volume
โœ…SOTA in modelling deformable objects

More: https://bit.ly/3NM5NX1
โค4๐Ÿคฏ1๐Ÿ˜ฑ1
๐Ÿ“ธ ~6 BILLION CLIP-filtered pairs ๐Ÿ“ธ

๐Ÿ‘‰A dataset 14x bigger than the previously biggest openly accessible image-text dataset in the world.

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…2,3B English image-text pairs
โœ…2,2B from 100+ other languages
โœ…1,3B language not detected
โœ…KNN index for quick search

More: https://bit.ly/3LFhKvT
โค3๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฅฎ PP-YOLOE: e-version of YOLO ๐Ÿฅฎ

๐Ÿ‘‰ SOTA object detector up to 149+ FPS!

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Optimized PP-YOLOv2
โœ…S/M/L/XL for different scenarios
โœ…149+ FPS, with TensorRT & FP16
โœ…Source code & models available

More: https://bit.ly/3x454uy
๐Ÿ”ฅ5๐Ÿ‘3๐Ÿ‘1๐Ÿคฏ1๐Ÿคฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿง™ HD synthesis with LDM ๐Ÿง™

๐Ÿ‘‰Low-cost DM via latent space of powerful pretrained autoencoders

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Hi-res synthesis of megapixel
โœ…Synthesis, inpainting, stochastic SR
โœ…Large, consistent images of โˆผ1024px
โœ…General conditioning via cross-attention
โœ…Code licensed under MIT License

More: https://bit.ly/3LIVOzS
๐Ÿ”ฅ6๐Ÿ‘3๐Ÿคฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸŽฉ SinNeRF: Single Image NeRF ๐ŸŽฉ

๐Ÿ‘‰NEural Radiance Field via single view only

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…UATX + UIUC + UOregon + Picsart AI
โœ…"Looking only onceโ€ approach
โœ…semi-supervised learning process
โœ…Geometry/semantic pseudo-labels
โœ…SOTA in novel-view synthesis

More: https://bit.ly/3ujMZqF
๐Ÿ‘7๐Ÿ”ฅ2๐Ÿ‘1๐Ÿคฏ1