Developer Center

Resources to get you started with Algorithmia

Classify Documents To Topics

Updated

Available on GitHub.

Finding topics in unstructured text data is a common use case that is easily solved using LDA, a topic modeling algorithm hosted on Algorithmia.

This recipe covers how to use LDA for topic extraction on text documents and then classify new documents to those topics.

For the full blog post related to this recipe, see Use LDA to Classify Text Documents.

Getting Started

Install the Algorithmia client from PyPi:

  install algorithmia 

You’ll also need a free Algorithmia account, which includes 5,000 free credits a month.

Sign up here, and then grab your API key.

Find this line in the script:

 
client = Algorithmia.client("YOUR_API_KEY")
 

and add in your API key.

How to Classify Documents with LDA

After putting in your own API key to the line above run it in your console environment:

  lda.py 

Built With