Skip to content

Google Cloud AutoML Natural Language for Toxicity Classification

License

Notifications You must be signed in to change notification settings

dvdbisong/automl-toxicity-classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Google Cloud AutoML Natural Language for Text Classification

Building a Language Toxicity Classification Model

Title Google Cloud AutoML Natural Language for Text Classification
Author Ekaba Bisong
Google Developer Expert in Machine Learning
Google Certified Professional Data Engineer
Website #

jigsaw automl nlp

Google Cloud AutoML for Natural Language provides the platform for designing and developing custom language models for language recognition use-cases. This project uses Google Cloud AutoML for Natural Language to develop an end-to-end language toxicity classification model to identify obscene text. The concept of neural architecture search and transfer learning are used under the hood to find the best network architecture and the optimal hyperparameter setting that improves the performance of the model.

About the Dataset

The data used in this project is from the Toxic Comment Classification Challenge on Kaggle by Jigsaw and Google. The data is modified to have a sample of 16,000 toxic and 16,000 non-toxic words as inputs to build the model on AutoML NLP.

The dataset is hosted on Kaggle and can be accessed at Toxic Comment Classification Challenge.

About

Google Cloud AutoML Natural Language for Toxicity Classification

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published