Skip to content

Commit

Permalink
Add nlp tools (#437)
Browse files Browse the repository at this point in the history
* add PEFT

* add sentence-transformers

* add megatron-lm

* add llama.cpp

* add FastChat

* add Interactive Composition Explorer

* add LMFlow

* add MLC LLM

* add ESPnet

* add Lamini

* Update README.md

* Update README.md

* add StableLM

* update new name

* Ollama

* Azure ChatGPT

* h2oGPT

* remove non-existent repo

* add dspy
  • Loading branch information
zhimin-z committed Sep 3, 2023
1 parent 4e135d2 commit 772b0e2
Showing 1 changed file with 3 additions and 0 deletions.
3 changes: 3 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -479,6 +479,7 @@ This repository contains a curated list of awesome open source libraries that wi
* [Blackstone](https://github.com/ICLRandD/Blackstone) ![](https://img.shields.io/github/stars/ICLRandD/Blackstone.svg?style=social) - Blackstone is a spaCy model and library for processing long-form, unstructured legal text. Blackstone is an experimental research project from the Incorporated Council of Law Reporting for England and Wales' research lab, ICLR&D.
* [Coqui STT](https://github.com/coqui-ai/STT) ![](https://img.shields.io/github/stars/coqui-ai/STT.svg?style=social) - Coqui STT is a fast, open-source, multi-platform, deep-learning toolkit for training and deploying speech-to-text models.
* [CTRL](https://github.com/salesforce/ctrl) ![](https://img.shields.io/github/stars/salesforce/ctrl.svg?style=social) - A Conditional Transformer Language Model for Controllable Generation released by SalesForce.
* [dspy](https://github.com/stanfordnlp/dspy) ![](https://img.shields.io/github/stars/stanfordnlp/dspy.svg?style=social) - A framework for programming with foundation models.
* [Dust](https://github.com/dust-tt/dust) ![](https://img.shields.io/github/stars/dust-tt/dust.svg?style=social) - Dust assists in the design and deployment of large language model apps.
* [ESPnet](https://github.com/espnet/espnet) ![](https://img.shields.io/github/stars/espnet/espnet.svg?style=social) - ESPnet is an end-to-end speech processing toolkit.
* [Facebook's XLM](https://github.com/facebookresearch/XLM) ![](https://img.shields.io/github/stars/facebookresearch/XLM.svg?style=social) - PyTorch original implementation of Cross-lingual Language Model Pretraining which includes BERT, XLM, NMT, XNLI, PKM, etc..
Expand All @@ -489,6 +490,7 @@ This repository contains a curated list of awesome open source libraries that wi
* [Gretel Synthetics](https://github.com/gretelai/gretel-synthetics) ![](https://img.shields.io/github/stars/gretelai/gretel-synthetics.svg?style=social) - Gretel Synthetics is a synthetic data generators for structured and unstructured text, featuring differentially private learning.
* [Grover](https://github.com/rowanz/grover) ![](https://img.shields.io/github/stars/rowanz/grover.svg?style=social) - Grover is a model for Neural Fake News -- both generation and detection. However, it probably can also be used for other generation tasks.
* [Guardrails](https://github.com/ShreyaR/guardrails) ![](https://img.shields.io/github/stars/ShreyaR/guardrails.svg?style=social) - Guardrails is a package that lets a user add structure, type and quality guarantees to the outputs of large language models.
* [h2oGPT](https://github.com/h2oai/h2ogpt) ![](https://img.shields.io/github/stars/h2oai/h2ogpt.svg?style=social) - h2oGPT is an open source generative AI, gives organizations like yours the power to own large language models while preserving your data ownership.
* [Haystack](https://github.com/deepset-ai/haystack) ![](https://img.shields.io/github/stars/deepset-ai/haystack.svg?style=social) - Haystack is an open source NLP framework to interact with your data using Transformer models and LLMs (GPT-3 and alike). Haystack offers production-ready tools to quickly build ChatGPT-like question answering, semantic search, text generation, and more.
* [Interactive Composition Explorer](https://github.com/oughtinc/ice) ![](https://img.shields.io/github/stars/oughtinc/ice.svg?style=social) - ICE is a Python library and trace visualizer for language model programs.
* [Kashgari](https://github.com/BrikerMan/Kashgari) ![](https://img.shields.io/github/stars/BrikerMan/Kashgari.svg?style=social) - Kashgari is a simple and powerful NLP Transfer learning framework, build a state-of-art model in 5 minutes for named entity recognition (NER), part-of-speech tagging (PoS), and text classification tasks.
Expand All @@ -499,6 +501,7 @@ This repository contains a curated list of awesome open source libraries that wi
* [LMFlow](https://github.com/OptimalScale/LMFlow) ![](https://img.shields.io/github/stars/OptimalScale/LMFlow.svg?style=social) - LMFlow is an extensible, convenient, and efficient toolbox for finetuning large machine learning models.
* [Megatron-LM](https://github.com/NVIDIA/Megatron-LM) ![](https://img.shields.io/github/stars/NVIDIA/Megatron-LM.svg?style=social) - Megatron-LM is a highly optimized and efficient library for training large language models.
* [MLC LLM](https://github.com/mlc-ai/mlc-llm) ![](https://img.shields.io/github/stars/mlc-ai/mlc-llm.svg?style=social) - MLC LLM is a universal solution that allows any language models to be deployed natively on a diverse set of hardware backends and native applications, plus a productive framework for everyone to further optimize model performance for their own use cases.
* [Ollama](https://github.com/jmorganca/ollama) ![](https://img.shields.io/github/stars/jmorganca/ollama.svg?style=social) - Get up and running with large language models, locally.
* [sense2vec](https://github.com/explosion/sense2vec) ![](https://img.shields.io/github/stars/explosion/sense2vec.svg?style=social) - A Pytorch library that allows for training and using sense2vec models, which are models that leverage the same approach than word2vec, but also leverage part-of-speech attributes for each token, which allows it to be "meaning-aware".
* [Sentence Transformers](https://github.com/UKPLab/sentence-transformers) ![](https://img.shields.io/github/stars/UKPLab/sentence-transformers.svg?style=social) - Sentence Transformers provides an easy method to compute dense vector representations for sentences, paragraphs, and images.
* [SpaCy](https://github.com/explosion/spaCy) ![](https://img.shields.io/github/stars/explosion/spaCy.svg?style=social) - Industrial-strength natural language processing library built with python and cython by the explosion.ai team.
Expand Down

0 comments on commit 772b0e2

Please sign in to comment.