PetiteProgrammer

PetiteProgrammer / NamedEntityRecognition / 0.1.0

README.md

Overview

This algorithm extracts entities (persons, organizations, etc) out of English text.

Usage

Input

Text input: ""

Output

List of entities. Each entity found has the following format

ParameterDescription
label: StringType of entity, e.g. "PERSON", "ORG" etc.
start_char: IntStart position in text
end_char: IntEnd position in text
text: StringOriginal part of the text

Supported entities

The following entities are supported and can be expected in the label attribute.

TypeDescription
PERSONPeople, including fictional.
NORPNationalities or religious or political groups.
FACILITYBuildings, airports, highways, bridges, etc.
ORGCompanies, agencies, institutions, etc.
GPECountries, cities, states.
LOCNon-GPE locations, mountain ranges, bodies of water.
PRODUCTObjects, vehicles, foods, etc. (Not services.)
EVENTNamed hurricanes, battles, wars, sports events, etc.
WORK_OF_ARTTitles of books, songs, etc.
LAWNamed documents made into laws.
LANGUAGEAny named language.
DATEAbsolute or relative dates or periods.
TIMETimes smaller than a day.
PERCENTPercentage, including "%".
MONEYMonetary values, including unit.
QUANTITYMeasurements, as of weight or distance.
ORDINAL"first", "second", etc.
CARDINALNumerals that do not fall under another type.