chiphuyen/lazynlp
Library to scrape and clean web pages to create massive datasets.
Language: Python
#artificial_intelligence #data_science #language_model #natural_language_processing #nlp #open #python #text_mining
Stars: 371 Issues: 2 Forks: 31
https://github.com/chiphuyen/lazynlp
  
  Library to scrape and clean web pages to create massive datasets.
Language: Python
#artificial_intelligence #data_science #language_model #natural_language_processing #nlp #open #python #text_mining
Stars: 371 Issues: 2 Forks: 31
https://github.com/chiphuyen/lazynlp
GitHub
  
  GitHub - chiphuyen/lazynlp: Library to scrape and clean web pages to create massive datasets.
  Library to scrape and clean web pages to create massive datasets. - chiphuyen/lazynlp
  pingpong-ai/xlnet-pytorch
2019 Google Brain's XLNet Pytorch Implementation
Language: Python
#language_model #pytorch #transformer_xl #xlnet #xlnet_pytorch
Stars: 110 Issues: 2 Forks: 9
https://github.com/pingpong-ai/xlnet-pytorch
  2019 Google Brain's XLNet Pytorch Implementation
Language: Python
#language_model #pytorch #transformer_xl #xlnet #xlnet_pytorch
Stars: 110 Issues: 2 Forks: 9
https://github.com/pingpong-ai/xlnet-pytorch
SKTBrain/KoBERT
Korean BERT pre-trained cased (KoBERT)
Language: Python
#korean_nlp #language_model
Stars: 132 Issues: 0 Forks: 17
https://github.com/SKTBrain/KoBERT
  
  Korean BERT pre-trained cased (KoBERT)
Language: Python
#korean_nlp #language_model
Stars: 132 Issues: 0 Forks: 17
https://github.com/SKTBrain/KoBERT
GitHub
  
  GitHub - SKTBrain/KoBERT: Korean BERT pre-trained cased (KoBERT)
  Korean BERT pre-trained cased (KoBERT). Contribute to SKTBrain/KoBERT development by creating an account on GitHub.
  maraoz/gpt-scrolls
A collaborative collection of open-source safe GPT-3 prompts that work well
#generator #gpt_3 #language_model #openai #safety #transformer
Stars: 123 Issues: 4 Forks: 7
https://github.com/maraoz/gpt-scrolls
  
  A collaborative collection of open-source safe GPT-3 prompts that work well
#generator #gpt_3 #language_model #openai #safety #transformer
Stars: 123 Issues: 4 Forks: 7
https://github.com/maraoz/gpt-scrolls
GitHub
  
  GitHub - maraoz/gpt-scrolls: A collaborative collection of open-source safe GPT-3 prompts that work well
  A collaborative collection of open-source safe GPT-3 prompts that work well - GitHub - maraoz/gpt-scrolls: A collaborative collection of open-source safe GPT-3 prompts that work well
  nlp-uoregon/trankit
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
Language: Python
#adapters #artificial_intelligence #deeplearning #dependency_parsing #language_model #lemmatization #machine_learning #morphological_tagging #multilingual #natural_language_processing #nlp #part_of_speech_tagging #pytorch #sentence_segmentation #tokenization #universal_dependencies #xlm_roberta
Stars: 120 Issues: 0 Forks: 8
https://github.com/nlp-uoregon/trankit
  
  Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
Language: Python
#adapters #artificial_intelligence #deeplearning #dependency_parsing #language_model #lemmatization #machine_learning #morphological_tagging #multilingual #natural_language_processing #nlp #part_of_speech_tagging #pytorch #sentence_segmentation #tokenization #universal_dependencies #xlm_roberta
Stars: 120 Issues: 0 Forks: 8
https://github.com/nlp-uoregon/trankit
GitHub
  
  GitHub - nlp-uoregon/trankit: Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
  Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing - nlp-uoregon/trankit
  will-thompson-k/tldr-transformers
The "tl;dr" on a few notable transformer papers.
#nlp #deep_learning #notes #transformers #attention #transfer_learning #language_models #language_model #bert #open_ai #huggingface #huggingface_transformer #gpt_3
Stars: 90 Issues: 2 Forks: 2
https://github.com/will-thompson-k/tldr-transformers
  
  The "tl;dr" on a few notable transformer papers.
#nlp #deep_learning #notes #transformers #attention #transfer_learning #language_models #language_model #bert #open_ai #huggingface #huggingface_transformer #gpt_3
Stars: 90 Issues: 2 Forks: 2
https://github.com/will-thompson-k/tldr-transformers
GitHub
  
  GitHub - will-thompson-k/tldr-transformers: The "tl;dr" on a few notable transformer papers (pre-2022).
  The "tl;dr" on a few notable transformer papers (pre-2022). - will-thompson-k/tldr-transformers
  DeutscheKI/tevr-asr-tool
State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI tool.
Language: C
#asr #c_plus_plus #command_line_tool #debian #debian_package #deep_learning #german_speech_recognition #kenlm #language_model #machine_learning #no_cloud #offline #pretrained_models #private #speech #speech_recognition #speech_to_text #tensorflow #tensorflow_lite #wave
Stars: 250 Issues: 0 Forks: 9
https://github.com/DeutscheKI/tevr-asr-tool
  
  State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI tool.
Language: C
#asr #c_plus_plus #command_line_tool #debian #debian_package #deep_learning #german_speech_recognition #kenlm #language_model #machine_learning #no_cloud #offline #pretrained_models #private #speech #speech_recognition #speech_to_text #tensorflow #tensorflow_lite #wave
Stars: 250 Issues: 0 Forks: 9
https://github.com/DeutscheKI/tevr-asr-tool
GitHub
  
  GitHub - DeutscheKI/tevr-asr-tool: State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This isโฆ
  State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI tool. - DeutscheKI/tevr-asr-tool
๐5
  extreme-bert/extreme-bert
ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper โExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERTโ.
Language: Python
#bert #deep_learning #language_model #language_models #machine_learning #natural_language_processing #nlp #python #pytorch #transformer
Stars: 135 Issues: 0 Forks: 5
https://github.com/extreme-bert/extreme-bert
  
  ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper โExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERTโ.
Language: Python
#bert #deep_learning #language_model #language_models #machine_learning #natural_language_processing #nlp #python #pytorch #transformer
Stars: 135 Issues: 0 Forks: 5
https://github.com/extreme-bert/extreme-bert
GitHub
  
  GitHub - extreme-bert/extreme-bert: ExtremeBERT is a toolkit that accelerates the pretraining of customized language models onโฆ
  ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper โExtremeBERT: A Toolkit for Accelerating Pretraining of Custom...
๐3
  JonasGeiping/cramming
Cramming the training of a (BERT-type) language model into limited compute.
Language: Python
#english_language #language_model #machine_learning
Stars: 127 Issues: 1 Forks: 11
https://github.com/JonasGeiping/cramming
  
  Cramming the training of a (BERT-type) language model into limited compute.
Language: Python
#english_language #language_model #machine_learning
Stars: 127 Issues: 1 Forks: 11
https://github.com/JonasGeiping/cramming
GitHub
  
  GitHub - JonasGeiping/cramming: Cramming the training of a (BERT-type) language model into limited compute.
  Cramming the training of a (BERT-type) language model into limited compute. - JonasGeiping/cramming
๐2
  BlinkDL/ChatRWKV
ChatRWKV is like ChatGPT but powered by the RWKV (100% RNN) language model, and open source.
Language: Python
#chatbot #chatgpt #language_model #pytorch #rnn #rwkv
Stars: 293 Issues: 0 Forks: 13
https://github.com/BlinkDL/ChatRWKV
  
  ChatRWKV is like ChatGPT but powered by the RWKV (100% RNN) language model, and open source.
Language: Python
#chatbot #chatgpt #language_model #pytorch #rnn #rwkv
Stars: 293 Issues: 0 Forks: 13
https://github.com/BlinkDL/ChatRWKV
GitHub
  
  GitHub - BlinkDL/ChatRWKV: ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
  ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source. - BlinkDL/ChatRWKV
๐ฅ3๐1๐1
  NVlabs/prismer
The implementation of "Prismer: A Vision-Language Model with An Ensemble of Experts".
Language: Python
#image_captioning #language_model #multi_modal_learning #multi_task_learning #vision_and_language #vision_language_model #vqa
Stars: 479 Issues: 6 Forks: 21
https://github.com/NVlabs/prismer
  
  The implementation of "Prismer: A Vision-Language Model with An Ensemble of Experts".
Language: Python
#image_captioning #language_model #multi_modal_learning #multi_task_learning #vision_and_language #vision_language_model #vqa
Stars: 479 Issues: 6 Forks: 21
https://github.com/NVlabs/prismer
GitHub
  
  GitHub - NVlabs/prismer: The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
  The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts". - NVlabs/prismer
๐ฅ3
  tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language: Python
#deep_learning #instruction_following #language_model
Stars: 2633 Issues: 3 Forks: 169
https://github.com/tatsu-lab/stanford_alpaca
  
  Code and documentation to train Stanford's Alpaca models, and generate the data.
Language: Python
#deep_learning #instruction_following #language_model
Stars: 2633 Issues: 3 Forks: 169
https://github.com/tatsu-lab/stanford_alpaca
GitHub
  
  GitHub - tatsu-lab/stanford_alpaca: Code and documentation to train Stanford's Alpaca models, and generate the data.
  Code and documentation to train Stanford's Alpaca models, and generate the data. - tatsu-lab/stanford_alpaca
๐3๐ฅ2๐2
  context-labs/autodoc
Experimental toolkit for auto-generating codebase documentation using LLMs
Language: TypeScript
#cli_tool #documentation_generator #language_model #typescript
Stars: 568 Issues: 7 Forks: 18
https://github.com/context-labs/autodoc
  
  Experimental toolkit for auto-generating codebase documentation using LLMs
Language: TypeScript
#cli_tool #documentation_generator #language_model #typescript
Stars: 568 Issues: 7 Forks: 18
https://github.com/context-labs/autodoc
GitHub
  
  GitHub - context-labs/autodoc: Experimental toolkit for auto-generating codebase documentation using LLMs
  Experimental toolkit for auto-generating codebase documentation using LLMs - context-labs/autodoc
๐2
  mlc-ai/web-llm
Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.
Language: Python
#chatgpt #deep_learning #language_model #llm #tvm #webgpu #webml
Stars: 1009 Issues: 1 Forks: 41
https://github.com/mlc-ai/web-llm
  
  Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.
Language: Python
#chatgpt #deep_learning #language_model #llm #tvm #webgpu #webml
Stars: 1009 Issues: 1 Forks: 41
https://github.com/mlc-ai/web-llm
GitHub
  
  GitHub - mlc-ai/web-llm: High-performance In-browser LLM Inference Engine
  High-performance In-browser LLM Inference Engine . Contribute to mlc-ai/web-llm development by creating an account on GitHub.
  xtekky/chatgpt-clone
ChatGPT interface with better UI + running on free gpt api's
Language: JavaScript
#chatgpt #chatgpt_api #chatgpt_app #chatgpt_clone #gpt_4 #gpt_4_api #gpt_interface #gpt3 #gpt4 #gpt4_api #gpt4all #interface #language #language_model #site #ui
Stars: 287 Issues: 4 Forks: 70
https://github.com/xtekky/chatgpt-clone
  
  ChatGPT interface with better UI + running on free gpt api's
Language: JavaScript
#chatgpt #chatgpt_api #chatgpt_app #chatgpt_clone #gpt_4 #gpt_4_api #gpt_interface #gpt3 #gpt4 #gpt4_api #gpt4all #interface #language #language_model #site #ui
Stars: 287 Issues: 4 Forks: 70
https://github.com/xtekky/chatgpt-clone
GitHub
  
  GitHub - xtekky/chatgpt-clone: ChatGPT interface with better UI
  ChatGPT interface with better UI . Contribute to xtekky/chatgpt-clone development by creating an account on GitHub.
๐3โค2๐ฅ2
  mlc-ai/mlc-llm
Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
Language: Python
#language_model #llm #machine_learning_compilation #tvm
Stars: 319 Issues: 5 Forks: 15
https://github.com/mlc-ai/mlc-llm
  
  Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
Language: Python
#language_model #llm #machine_learning_compilation #tvm
Stars: 319 Issues: 5 Forks: 15
https://github.com/mlc-ai/mlc-llm
GitHub
  
  GitHub - mlc-ai/mlc-llm: Universal LLM Deployment Engine with ML Compilation
  Universal LLM Deployment Engine with ML Compilation - mlc-ai/mlc-llm
  salesforce/xgen
Salesforce open-source LLMs with 8k sequence length.
Language: Python
#language_model #large_language_models #llm #nlp
Stars: 357 Issues: 6 Forks: 18
https://github.com/salesforce/xgen
  
  Salesforce open-source LLMs with 8k sequence length.
Language: Python
#language_model #large_language_models #llm #nlp
Stars: 357 Issues: 6 Forks: 18
https://github.com/salesforce/xgen
GitHub
  
  GitHub - salesforce/xgen: Salesforce open-source LLMs with 8k sequence length.
  Salesforce open-source LLMs with 8k sequence length. - salesforce/xgen
๐4
  