Skip to content
This repository has been archived by the owner on Mar 28, 2019. It is now read-only.

Service for extracting named entities from text fragments

License

Notifications You must be signed in to change notification settings

occrp-attic/extract-entities

Repository files navigation

extract-entities

This is a simple gRPC service to manage the extraction of named entities from text using one of several open source tools. The main reason for having this run in a service is to avoid inlining the required training data and Python dependencies in the main aleph application.

Usage

In order to use this container, you need to use the same protocol buffer specifications active on the server. The simplest way to use them would be via the servicelayer Python library, but the protocol can also be used independently.

Check out the following client code to see how extract-entities is used:

https://github.com/alephdata/servicelayer/blob/master/servicelayer/rpc/__init__.py

You can also inspect the protocol buffer files directly here:

https://github.com/alephdata/servicelayer/tree/master/protos/servicelayer/rpc

The container itself can be pulled from the DockerHub and will expose a service on port 50000:

docker pull alephdata/recognize-text
docker run -p 50000:50000 -ti alephdata/recognize-text

NER backend options

About

Service for extracting named entities from text fragments

Resources

License

Stars

Watchers

Forks

Packages

No packages published