DL in NLP links – Telegram

DL in NLP links

1.06K subscribers

5 photos

1 file

653 links

AI and DeepLearning news/articles links I use for @dlinnlp posts

Download Telegram

About

Blog

Apps

Platform

DL in NLP links

1.06K subscribers

DL in NLP links

https://github.com/jndean/LossRider

GitHub - jndean/LossRider: A plotting tool that outputs Line Rider maps, so you can watch a man on a sled scoot down your loss…

A plotting tool that outputs Line Rider maps, so you can watch a man on a sled scoot down your loss curves. 🎿 - jndean/LossRider

3.5K views22:49

DL in NLP links

https://arxiv.org/abs/2109.00137

Implicit Behavioral Cloning

We find that across a wide range of robot policy learning scenarios, treating supervised policy learning with an implicit model generally performs better, on average, than commonly used explicit...

3.73K views23:28

DL in NLP links

https://icrt.dev/

In-Context Imitation Learning via Next-Token Prediction

3.83K views04:33

DL in NLP links

https://x.com/spikedoanz/status/1831127711856935273?s=12&t=757tdnLa___vKX7ZeJax5A

3.99K views05:01

DL in NLP links

https://discuss.pytorch.org/t/distributed-w-torchtitan-introducing-async-tensor-parallelism-in-pytorch/209487

[Distributed w/ TorchTitan] Introducing Async Tensor Parallelism in PyTorch

with Horace He, Less Wright, Luca Wehrstedt, Tianyu Liu, Wanchao Liang TL;DR We implemented experimental async tensor parallelism support in PyTorch. We integrated it in TorchTitan and observed: Up to ~29% forward pass speedup and ~8% E2E speedup in Llama3…

🔥2

4.12K views03:55

DL in NLP links

https://arxiv.org/abs/2409.12917

Training Language Models to Self-Correct via Reinforcement Learning

Self-correction is a highly desirable capability of large language models (LLMs), yet it has consistently been found to be largely ineffective in modern LLMs. Current methods for training...

4.33K views04:29

DL in NLP links

https://arxiv.org/pdf/2405.08007

4.65K views17:30

DL in NLP links

Tldr: act dumb

4.46K views17:30

DL in NLP links

https://x.com/kellerjordan0/status/1842300916864844014?s=12&t=QgBLS4SmhE8cqdYBmhrqJA

3.9K views15:38

DL in NLP links

https://x.com/stasbekman/status/1843483262129492200?s=12&t=QgBLS4SmhE8cqdYBmhrqJA

4.05K views15:52

DL in NLP links

https://archive.is/2024.10.07-184310/https://www.theatlantic.com/technology/archive/2024/10/terence-tao-ai-interview/680153/

We’re Entering Uncharted Territory for Math - The Atlantic

archived 7 Oct 2024 18:43:10 UTC

❤1

4.8K views02:31

DL in NLP links

https://x.com/arankomatsuzaki/status/1844567821184872544?s=12&t=QgBLS4SmhE8cqdYBmhrqJA

5.21K views04:21

DL in NLP links

https://x.com/pronounced_kyle/status/1845451573608186103

6.14K views18:36

DL in NLP links

https://arxiv.org/pdf/2406.14517

6.42K views15:45

DL in NLP links

https://x.com/yoavgo/status/1845835419264442772?s=12&t=QgBLS4SmhE8cqdYBmhrqJA

👍1

7.11K views15:47

DL in NLP links

https://arxiv.org/abs/2410.05258

Differential Transformer

Transformer tends to overallocate attention to irrelevant context. In this work, we introduce Diff Transformer, which amplifies attention to the relevant context while canceling noise....

7.36K views03:24

DL in NLP links

https://x.com/lchoshen/status/1849060908242231329?s=12&t=QgBLS4SmhE8cqdYBmhrqJA

7.09K views02:37

DL in NLP links

https://arxiv.org/abs/2410.01104

Softmax is not Enough (for Sharp Size Generalisation)

A key property of reasoning systems is the ability to make sharp decisions on their input data. For contemporary AI systems, a key carrier of sharp behaviour is the softmax function, with its...

👍1

7.9K views09:00

DL in NLP links

https://x.com/svlevine/status/1856924796996784244?s=12&t=QgBLS4SmhE8cqdYBmhrqJA

7.71K views17:22

DL in NLP links

https://arxiv.org/abs/2412.01799

HPRM: High-Performance Robotic Middleware for Intelligent...

The rise of intelligent autonomous systems, especially in robotics and autonomous agents, has created a critical need for robust communication middleware that can ensure real-time processing of...

7.69K views17:41

DL in NLP links

https://x.com/thehumanoidhub/status/1868219800532771248?s=12&t=QgBLS4SmhE8cqdYBmhrqJA

X (formerly Twitter)

The Humanoid Hub (@TheHumanoidHub) on X

Meta Motivo, an open-source behavioral foundation model designed to control virtual, physics-based humanoid agents.

It aims to significantly simplify the creation of general-purpose humanoid agents for robotics and virtual avatars.

Try the demo: https:…

❤1👍1

8.18K views15:59