Data Science by ODS.ai 🦜
46.1K subscribers
664 photos
77 videos
7 files
1.75K links
First Telegram Data Science channel. Covering all technical and popular staff about anything related to Data Science: AI, Big Data, Machine Learning, Statistics, general Math and the applications of former. To reach editors contact: @malev
Download Telegram
A Bag of Tricks for Image Classification

1. Large batch size
2. Mini model-tweaks
3. Refined Training Methods
4. Transfer Learning
5. Fancy Data Augmentation
https://link.medium.com/fzJvIBfsJS

#CV #tipsandtricks
Bayesian Optimization in AlphaGo

How latest AlphaGo agent win rate was improved from 50% to 66.5%.

ArXiV: https://arxiv.org/abs/1812.06855
​​Facebook have created and now open-sourced Nevergrad, a Python3 library that claims making easier to perform gradient-free optimizations.

Link: https://code.fb.com/ai-research/nevergrad/
Github: https://github.com/facebookresearch/nevergrad
Smart Compose: Using Neural Networks to Help Write Emails

Google shared some information about their new feature. Most important: they claim to focus on Fairness and Privacy, training on completely anonimized data and trying to eliminate biases.

Link: https://ai.googleblog.com/2018/05/smart-compose-using-neural-networks-to.html

#Google #SmartCompose #FairAI #Privacy
​​Reproducing high-quality singing voice
with state-of-the-art AI technology.

Some advance in singing voice synthesis. This opens path toward more interesting collaborations and sythetic celebrities projects.

P.S. Hatsune Miku's will still remain popular for their particular qualities, but now there is more room for competitors.

Link: https://www.techno-speech.com/news-20181214a-en

#SOTA #Voice #Synthesis
​​Overview of current state of autonomously driving vehicle by Ben Evans.

Not so technical overview of where first autonomous vehicles will become commodity.

Link: https://www.ben-evans.com/benedictevans/2018/3/26/steps-to-autonomy
​​Creating super slow motion videos by predicting missing frames using a neural network, instead of simple interpolation. With code.

Github: https://github.com/avinashpaliwal/Super-SloMo
Website: https://people.cs.umass.edu/~hzjiang/projects/superslomo/
​​Scaling Uber’s Apache Hadoop Distributed File System for Growth

Post on how #Uber team handles #Hadoop challenges.

https://eng.uber.com/scaling-hdfs/

#BigData #HDFS
​​Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond

New SOTA on cross-lingual transfer (XNLI, MLDoc) and bitext mining (BUCC) using a shared encoder for 93 languages.

Link: https://arxiv.org/abs/1812.10464

#SOTA #NLP
​​Building Automated Feature Rollouts on Robust Regression Analysis

Nice article on important thing β€” statistical analysis of hypothesis testing. Every new feature or change made to existent one is basically an experiment. Article covers how #Uber team handles this in live system.

Link: https://eng.uber.com/autonomous-rollouts-regression-analysis/

#Uber #statistics #production #truestory
A disciplined approach to neural network hyper-parameters

Recommendations on how to optimize learning rate, weight decay, momentum and batch size.

ArXiV: https://arxiv.org/pdf/1803.09820.pdf

#nn #hyperopt
Top 10 IPython Notebook Tutorials for Data Science and Machine Learning

List mostly for beginners.

Link: https://www.kdnuggets.com/2016/04/top-10-ipython-nb-tutorials.html

#novice #beginner #ipython #jupyter
Tomorrow (05 Jan) we are holding first offline meeting for this channel members and all the Data Scientists in Paris.

You are kindly welcome to come by Malongo cafΓ© at 10:00 to chat, share experience and have a coffee with fellow data scientist if you are in Paris these days.