nlp

nlp / SummarizeURL / 0.1.4

README.md

1. Introduction

This algorithm utilizes two algorithm: util/Html2Text and nlp/Summarizer. It retrieves the html content in a smart way by using heuristics with Html2Text algorithm, and summarizes the content using the Summarizer algorithm.

Input:

  • (Required): String URL.
  • (Optional): Number of sentences. (default=3)

Output:

  • Summarized page content.

Examples:

  • Parameter 1: URL

Example 1:

"http://techcrunch.com/2015/03/12/algorithmia-launches-with-more-than-800-algorithms-on-its-marketplace/"

Output:

"Algorithmia, the startup that raised $2.4 million last August to connect academics building powerful algorithms and the app developers who could put them to use, just brought its marketplace out of private beta. Algorithm developers can host their work on the site and charge a fee per-use to developers who integrate the algorithm into their own work. The platform encourages further additions to its library through a bounty system, which lets users request algorithms that researchers familiar with the field can contribute from their work or develop from scratch for a fee."

Example 2:

  • Parameter 1: URL
  • Parameter 2: Number of sentence(s)
[
    "http://techcrunch.com/2015/03/12/algorithmia-launches-with-more-than-800-algorithms-on-its-marketplace/",
    1
]

Output:

"Algorithmia, the startup that raised $2.4 million last August to connect academics building powerful algorithms and the app developers who could put them to use, just brought its marketplace out of private beta."