Data Science by ODS.ai 🦜
46.1K subscribers
663 photos
77 videos
7 files
1.75K links
First Telegram Data Science channel. Covering all technical and popular staff about anything related to Data Science: AI, Big Data, Machine Learning, Statistics, general Math and the applications of former. To reach editors contact: @malev
Download Telegram
hey, we have 295 so far, and it is awesome, thank you!
It means that 15% of post viewers filled the questionnaire form of one question.

We kindly ask you to fill in if you haven’t yet, because we need 200 more responses to make this poll statistically significant!
πŸ’ͺπŸŽ†πŸŽ‡ πŸŽ‰we did it! 665 responses collected so far!
Thank you all. Results will be published as anonimized CSV (only countries and number of votes) later.
And, channel statistics, as promised. You can look at the numbers at: https://docs.google.com/spreadsheets/d/1_y6ojxU7svUAmPViqWveUf0yVLqoZyukJTGikcC_5hQ.

And again, thank you for your support!
A Bag of Tricks for Image Classification

1. Large batch size
2. Mini model-tweaks
3. Refined Training Methods
4. Transfer Learning
5. Fancy Data Augmentation
https://link.medium.com/fzJvIBfsJS

#CV #tipsandtricks
Bayesian Optimization in AlphaGo

How latest AlphaGo agent win rate was improved from 50% to 66.5%.

ArXiV: https://arxiv.org/abs/1812.06855
​​Facebook have created and now open-sourced Nevergrad, a Python3 library that claims making easier to perform gradient-free optimizations.

Link: https://code.fb.com/ai-research/nevergrad/
Github: https://github.com/facebookresearch/nevergrad
Smart Compose: Using Neural Networks to Help Write Emails

Google shared some information about their new feature. Most important: they claim to focus on Fairness and Privacy, training on completely anonimized data and trying to eliminate biases.

Link: https://ai.googleblog.com/2018/05/smart-compose-using-neural-networks-to.html

#Google #SmartCompose #FairAI #Privacy
​​Reproducing high-quality singing voice
with state-of-the-art AI technology.

Some advance in singing voice synthesis. This opens path toward more interesting collaborations and sythetic celebrities projects.

P.S. Hatsune Miku's will still remain popular for their particular qualities, but now there is more room for competitors.

Link: https://www.techno-speech.com/news-20181214a-en

#SOTA #Voice #Synthesis
​​Overview of current state of autonomously driving vehicle by Ben Evans.

Not so technical overview of where first autonomous vehicles will become commodity.

Link: https://www.ben-evans.com/benedictevans/2018/3/26/steps-to-autonomy
​​Creating super slow motion videos by predicting missing frames using a neural network, instead of simple interpolation. With code.

Github: https://github.com/avinashpaliwal/Super-SloMo
Website: https://people.cs.umass.edu/~hzjiang/projects/superslomo/
​​Scaling Uber’s Apache Hadoop Distributed File System for Growth

Post on how #Uber team handles #Hadoop challenges.

https://eng.uber.com/scaling-hdfs/

#BigData #HDFS
​​Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond

New SOTA on cross-lingual transfer (XNLI, MLDoc) and bitext mining (BUCC) using a shared encoder for 93 languages.

Link: https://arxiv.org/abs/1812.10464

#SOTA #NLP
​​Building Automated Feature Rollouts on Robust Regression Analysis

Nice article on important thing β€” statistical analysis of hypothesis testing. Every new feature or change made to existent one is basically an experiment. Article covers how #Uber team handles this in live system.

Link: https://eng.uber.com/autonomous-rollouts-regression-analysis/

#Uber #statistics #production #truestory
A disciplined approach to neural network hyper-parameters

Recommendations on how to optimize learning rate, weight decay, momentum and batch size.

ArXiV: https://arxiv.org/pdf/1803.09820.pdf

#nn #hyperopt