TriplesExtraction

This repository runs two approaches to extract knowledge triples from the text. The first approach is based on dependency parsing while the second approach uses Bert token classification to classify the knowledge triples. Currently this repository only supports 20 News Dataset.

Features

Downloads and cleans the data.
Extracts triples.
Saves extracted triples in data/20NewsGroups.csv

Running

To do the extraction via dependency parse, use 'dep' mode in command line.

python3 extract.py --mode dep

The script will extract the triples and will save them in data/20NewsGroups.csv in 'triples' column.

For extracting triples doing inference on transformer based model, use 'bert' mode in command line.

But, beforehand, please make sure the following points are in check.

The checkpoint has been downloaded from the google drive link (provided in the email) and the config.py file has the location of checkpoint.

python3 extract.py --mode bert

Note

The GPU based training can be done using Colab_training: Triples.ipynb notebook. The GPU based k-fold inference can be done using Colab_inference: Triples.ipynb notebook.

To use these notebooks, please point the required data/checkpoints in the Config->Globals section.

Thank you.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
HuggingFace		HuggingFace
data		data
.DS_Store		.DS_Store
.gitignore		.gitignore
Colab_Inference_Triples.ipynb		Colab_Inference_Triples.ipynb
Colab_training_Triples.ipynb		Colab_training_Triples.ipynb
LICENSE		LICENSE
README.md		README.md
config.py		config.py
dataset.py		dataset.py
extract.py		extract.py
model.py		model.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TriplesExtraction

Features

Running

Note

About

Releases

Packages

Languages

License

iabd/TriplesExtraction

Folders and files

Latest commit

History

Repository files navigation

TriplesExtraction

Features

Running

Note

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages