ArtificialIntelligenceArticles
2.97K subscribers
1.64K photos
9 videos
5 files
3.86K links
for who have a passion for -
1. #ArtificialIntelligence
2. Machine Learning
3. Deep Learning
4. #DataScience
5. #Neuroscience

6. #ResearchPapers

7. Related Courses and Ebooks
Download Telegram
MegatronLM: Training Billion+ Parameter Language Models Using GPU Model Parallelism
"... training an 8.3 billion parameter transformer language model with 8-way model parallelism and 64-way data parallelism on 512 GPUs, making it the largest transformer based language model ever trained at 24x the size of BERT and 5.6x the size of GPT-2."
Blog by NVIDIA Applied Deep Learning Research : https://nv-adlr.github.io/MegatronLM
Code: https://github.com/nvidia/megatron-lm
#ArtificialIntelligence #DeepLearning #NLP #PyTorch #Transformer
Generating Diverse High-Fidelity Images with VQ-VAE-2

Ali Razavi, Aaron van den Oord, Oriol Vinyals : https://arxiv.org/abs/1906.00446

#DeepLearning #VariationalAutoEncoder #VAE
Project Euphonia’s Personalized Speech Recognition for Non-Standard Speech
Blog by Joel Shor and Dotan Emanuel : https://ai.googleblog.com/2019/08/project-euphonias-personalized-speech.html
#ArtificialIntelligence #DeepLearning #NeuralNetworks
Is Deep Reinforcement Learning Really Superhuman on Atari?
Marin Toromanoff, Emilie Wirbel, Fabien Moutarde : https://arxiv.org/abs/1908.04683
#deeplearning #machinelearning #reinforcementlearning
Object as Distribution #NeurIPS2019


Propose bivariate normal distribution for object detection representation.
Benefits detection of highly-overlapping objects and downstream tracking

https://arxiv.org/abs/1907.12929
Forwarded from Lex Fridman
The following is our paper on driver functional vigilance during use of Tesla Autopilot driver assistance system. We analyzed 18,928 Autopilot disengagements. 3+ years of hard work with an incredible research team at MIT. Example videos out next week.

link: https://hcai.mit.edu/human-side-of-tesla-autopilot/
Forwarded from Lex Fridman
If a neural network generates an image, who owns the copyright? The owner of the dataset that the net was trained on? The designer of the network architecture? The person running the code? Or... the AI system itself? @lexfridman
Multiscale Representations for Manifold-Valued Data

Rahman et al.: https://statweb.stanford.edu/~symmlab/SymmPaper.pdf

#SymmetricSpace #Wavelets #Denoising
Reward Tampering Problems and Solutions in Reinforcement Learning: A Causal Influence Diagram Perspective

Tom Everitt and Marcus Hutter : https://arxiv.org/abs/1908.04734

#ArtificialIntelligence #MachineLearning #ReinforcementLearning
Why BLEU score sucks for evaluating translation systems.
(Or rather, why BLEU score works fine when you translation system sucks, but sucks when it's good).

https://arxiv.org/abs/1908.05204
Welcome SUPERGLUE from Facebook AI, DeepMind, University of Washington and New York University.
It comprises new ways to test creative approaches on a range of difficult NLP tasks and serves a series of benchmark tasks to measure the performance of modern, high performance language-understanding AI.

Made on the premise that deep learning models for conversational AI have “hit a ceiling” and need greater challenges .

Read https://arxiv.org/pdf/1905.00537.pdf