Проекты машинного обучения
78 subscribers
4 photos
414 links
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"

📝
https://github.com/ybybzhang/controlvideo
Large Language Models as Tool Makers

📝Recent research shows the potential of enhancing the problem-solving ability of large language models (LLMs) through the use of external tools. However, prior work along this line depends on the availability of existing tools. In this work, we take an initial step towards removing this dependency by proposing a closed-loop framework, referred to as LLMs A s Tool M akers (LATM), where LLMs create their own reusable tools for problem-solving
https://github.com/ctlllll/llm-toolmaker
NeRO: Neural Geometry and BRDF Reconstruction of Reflective Objects from Multiview Images

📝We present a neural rendering-based method called NeRO for reconstructing the geometry and the BRDF of reflective objects from multiview images captured in an unknown environment.
https://github.com/liuyuan-pal/NeRO
AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

📖Large language models (LLMs) have shown excellent performance on various tasks, but the astronomical model size raises the hardware barrier for serving (memory size) and slows down token generation (memory bandwidth).

https://github.com/mit-han-lab/llm-awq
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles

📖Modern hierarchical vision transformers have added several vision-specific components in the pursuit of supervised classification performance.

https://github.com/facebookresearch/hiera