AI with Papers - Artificial Intelligence & Deep Learning
15.2K subscribers
135 photos
247 videos
14 files
1.31K links
All the AI with papers. Every day fresh updates about #DeepLearning, #MachineLearning, LLMs and #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#artificialintelligence #machinelearning #ml #AI
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ¦•JoJoGAN: One Shot Face StylizationπŸ¦•

πŸ‘‰UIUC researchers unveil a novel method for one-shot image stylization.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Stylization from single input
βœ…Finetuning StyleGAN for stylization
βœ…No supervision, good generalization
βœ…MIT License (commercial allowed)

More: https://bit.ly/3ASVzyb
❀5πŸ‘2πŸ‘2
This media is not supported in your browser
VIEW IN TELEGRAM
🧦SOTA in OOD detection for safer #AI🧦

πŸ‘‰Out-of-distribution (OOD) detection produces wrong/overconfident predictions.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Novel framework for OOD
βœ…Synthesizing virtual outliers
βœ…Novel unknown-aware training
βœ…Code and model available

More: https://bit.ly/3JnFIL9
πŸ”₯3πŸ‘2🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸŒ…StyleGAN-XL neural synthesisπŸŒ…

πŸ‘‰From TΓΌbingen, StyleGAN-XL: new SOTA for large diverse dataset.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…First 1024p-gen for large data
βœ…Growing strategy on StyleGAN3
βœ…Beyond the narrow domains
βœ…Pivotal Tuning Inversion (TPI)
βœ…SOTA vs. GAN & diffusion models

More: https://bit.ly/3HK9MQk
πŸ”₯6πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ“ŒThis keypoint is pure GLUEπŸ“Œ

πŸ‘‰Keypoints play a central role in computer vision.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Novel Object-centric keypoint
βœ…Novel sim2real training method
βœ…Intra-salience / inter-distinctness
βœ…Enforcing semantic consistency
βœ…Close to fully-supervised method!

More: https://bit.ly/3rth1qh
πŸ”₯5πŸ₯°1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ’‘ LEDNet: seeing in the dark πŸ’‘

πŸ‘‰Researchers from NTU unveil LEDNet to see in the dark

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Novel data synthesis for low-light
βœ…Low-light/deblurring dataset
βœ…12k low-blur/normal-sharp pairs
βœ…LEDNet: lowlight + deblurring


More: https://bit.ly/3HIyYqM
πŸ‘6πŸ‘4πŸ”₯3🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘©β€πŸ¦°Back in the 50's with GANπŸ‘©β€πŸ¦°

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…A few thousand vintage faces
βœ…Models available for download
βœ…Stylegan2-ffhqu-1024x1024
βœ…NO Commercial allowed

More: https://bit.ly/3LlOyKX
🀯2❀1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🦠VNCA: bio-inspired generative model 🦠

πŸ‘‰A novel generative model loosely inspired by the biological processes of cellular growth and differentiation

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Variational Neural Cellular Automata
βœ…Probabilistic generative model
βœ…Learn from common vector format
βœ…Learn purely s.o. generative process
βœ…Far away from SOTA, but interesting

More: https://bit.ly/3oGb2wG
πŸ‘4πŸ”₯1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🍊Block-NeRF: Neural View Synthesis🍊

πŸ‘‰Large-scale scene reconstruction by multiple compact NeRFs that each fit into memory.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Berkeley + Google + Waymo = 🀯
βœ…Scaling NeRF to city-scale scenes
βœ…Trick: multiple simple NeRFs
βœ…Time decoupled, arbitrarily large scene
βœ…Data over months & different conditions

More: https://bit.ly/3GGVHBV
πŸ‘4πŸ”₯3🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯¬HW-Accelerated Neuro-EvolutionπŸ₯¬

πŸ‘‰Scalable, general purpose, hardware accelerated neuro-evolution toolkit by Google

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Parallel on multiple TPU/GPUs
βœ…Neuro-evo algorithms with NNs
βœ…WaterWorld, Abstract paint, more
βœ…From Google, not an official product
βœ…Code under Apache License 2.0

More: https://bit.ly/3szEi9w
πŸ‘3πŸ”₯2🀯1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸš› DeepETA: #Uber ETA via #AIπŸš›

πŸ‘‰Uber unveils the low-latency deep architecture for global ETA prediction

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Latency / Accuracy / Generality
βœ…7 NNs architectures tested
βœ…Encoder-decoder + Self-Attention
βœ…Linear transformer (kernel trick)
βœ…Feature sparsity for speed

More: https://bit.ly/3gFWmJh
πŸ‘3πŸ”₯1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
✏️CLIPasso: Semantic Sketching via CLIP✏️

πŸ‘‰Sketching method guided by geometric and semantic simplifications (CLIP)

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…EPFL, TAU and IDC Herzliya
βœ…CLIP image encoder for sketching
βœ…Sketching as a set of Bezier curves
βœ…Param-optimization on CLIP-loss
βœ…Source code and models available

More: https://bit.ly/3oLEDF4
πŸ”₯2πŸ₯°2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ‚SAHI: slicing detection/segmentationπŸͺ‚

πŸ‘‰An open-source lightweight library for large scale object detection & instance segmentation

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Slicing Aided Hyper Inference
βœ…Large-scale detection/segment.
βœ…Sliced inference and merging
βœ…Utils for conversion, slicing, etc.
βœ…Code licensed under MIT License

More: https://bit.ly/3uMJoBZ
πŸ”₯3❀2🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🎁100,000,000 image-text pairs!🎁

πŸ‘‰Large-scale Chinese cross-modal dataset for benchmarking different multi-modal pre-training methods.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…100 Million <image, text> pairs
βœ…>200px size, aspect ratio (1/3~3)
βœ…Models of ResNet, ViT & SwinT
βœ…Methods: CLIP, FILIP and LiT
βœ…Privacy/Sensitive words πŸ€”

More: https://bit.ly/34BqlzX
πŸ‘5πŸ€”1
This media is not supported in your browser
VIEW IN TELEGRAM
🧁33 Million synthetic pedestrians🧁

πŸ‘‰A novel large, fully synthetic dataset

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Exploiting the #gta5 engine
βœ…764 full-HD videos @20 fps
βœ…33M+ person instances
βœ…BBs & segmentation masks
βœ…2D/3D keypoints & depth

More: https://bit.ly/36njlY1
πŸ‘6🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯Marker-free 6D-point trackingπŸ₯

πŸ‘‰Full position and rotation of skeletal joints, with only a RGB frame

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Full 3-axis joint rotations
βœ…V-markers, emulating mocap
βœ…#3D from monocular with NN
βœ…Generalization, no retraining
βœ…SOTA rotation/position est.

More: https://bit.ly/34GdoF5
πŸ”₯12🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🧼 Synthetic dataset for #Retail 🧼

πŸ‘‰A large-scale photorealistic synthetic dataset with annotations for semantic segmentation, instance segmentation, depth estimation, and object detection.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Dataset from Standard.AI
βœ…2,134 unique scenes
βœ…25k+ annotated samples
βœ…Introducing the "change detection"
βœ…Multi-view representation learning
βœ…NonCommercial-ShareAlike 4.0

More: https://bit.ly/3uXqubB
🀯6πŸ₯°3πŸ‘1πŸ”₯1πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🌈 Graph Neural Nets Forecasting🌈

πŸ‘‰Data-driven approach for forecasting global weather using graph neural networks

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Data-driven forecasting via GNNs
βœ…Model: 6.7M parameters, float32
βœ…6-hours forecast in 0.04 secs.
βœ…A 5-day forecast in 0.8 secs.

More: https://bit.ly/3LH4CXR
πŸ‘4πŸ‘2πŸ€”1
Media is too big
VIEW IN TELEGRAM
πŸ₯«Watch Those Words!πŸ₯«

πŸ‘‰Berkeley unveils a novel approach to discover cheap-fake and visually persuasive deep-fakes

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Regardless of falsification
βœ…Semantic person-specific
βœ…Word-conditioned analysis
βœ…Generalization across fakes

More: https://bit.ly/3oXWmcd
πŸ‘5😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”‹V2X-sim for #selfdriving is out!πŸ”‹

πŸ‘‰V2X: collaboration between a vehicle and any surrounding entity

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Suitable for #selfdrivingcars
βœ…Rec. from road & vehicles
βœ…Multi-streams/perception
βœ…Detection, tracking, & segmentation
βœ…RGB, depth, semantic, BEV & LiDAR

More: https://bit.ly/3H6veOI
πŸ”₯6🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
🍏Infinite Synthetic dataset for Fitness🍏

πŸ‘‰Opensource synthetic images for fitness, single/multi-person, and realistic variation in lighting, camera angles, and occlusions

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…60k images, 1-5 avatars
βœ…15 categories, 21 variations
βœ…Blender and ray-tracing
βœ…SMPL-X + facial expression
βœ…Cloth/skin tone sampled
βœ…147 4K HDRI panoramas
βœ…Creative Commons 4.0

More: https://bit.ly/33B1R9q
🀩5❀1πŸ‘1