Html 2 Text

No algorithm description given

Table of Contents Introduction Examples Credits Introduction Takes in a url and extracts the content from the page.  Makes an attempt to remove non-content text like navigation and footer text. Input: (Required): Website URL. Output: Extracted text from website URL. Examples Example 1. Parameter 1: Wikipedia article URL. "https://en.wikipedia.org/wiki/Aziz_Sancar" Output: Aziz Sancar (born 8 September 1946) is a Turkish-American biochemist and molecular biologist specializing in DNA repair, cell cycle checkpoints, and circadian clock.[4] In 2015, he was awarded the Nobel Prize in Chemistry ... [24] Sancar is the second Turkish Nobel laureate after Orhan Pamuk, who is also an alumnus of Istanbul University. Example 2. Parameter 1: Techcrunch article URL. "http://techcrunch.com/2015/03/12/algorithmia-launches-with-more-than-800-algorithms-on-its-marketplace/" Output: "Algorithmia, the startup that raised $2.4 million last August to connect academics building powerful algorithms and the app developers who could put them to use, just brought its marketplace out of private beta. More than 800 algorithms are available on the marketplace, providing the smarts needed to do various tasks in the fields of machine learning, audio and visual processing, and even computer vision. Algorithm developers can host their work on the site and charge a fee per-use to developers who integrate the algorithm into their own work. The platform encourages further additions to its library through a bounty system, which lets users request algorithms that researchers familiar with the field can contribute from their work or develop from scratch for a fee. To demonstrate the platform’s algorithm hosting tools, the Algorithmia team built a simple app using seven user-contributed algorithms that visualizes what a crawler does as it works through links to build the structure of a site." Credits JSOUP was used to scrape content from HTML in this algorithm.

Tags
(no tags)

Cost Breakdown

0 cr
royalty per call
1 cr
usage per second
avg duration
This algorithm has permission to call other algorithms which may incur separate royalty and usage costs.

Cost Calculator

API call duration (sec)
×
API calls
=
Estimated cost
per calls
for large volume discounts
For additional details on how pricing works, see Algorithmia pricing.

Internet access

This algorithm has Internet access. This is necessary for algorithms that rely on external services, however it also implies that this algorithm is able to send your input data outside of the Algorithmia platform.


Calls other algorithms

This algorithm has permission to call other algorithms. This allows an algorithm to compose sophisticated functionality using other algorithms as building blocks, however it also carries the potential of incurring additional royalty and usage costs from any algorithm that it calls.


To understand more about how algorithm permissions work, see the permissions documentation.

1. Type your input

2. See the result

Running algorithm...

3. Use this algorithm

curl -X POST -d '{{input | formatInput:"curl"}}' -H 'Content-Type: application/json' -H 'Authorization: Simple YOUR_API_KEY' https://api.algorithmia.com/v1/algo/util/Html2Text/0.1.4
View cURL Docs
algo auth
# Enter API Key: YOUR_API_KEY
algo run algo://util/Html2Text/0.1.4 -d '{{input | formatInput:"cli"}}'
View CLI Docs
import com.algorithmia.*;
import com.algorithmia.algo.*;

String input = "{{input | formatInput:"java"}}";
AlgorithmiaClient client = Algorithmia.client("YOUR_API_KEY");
Algorithm algo = client.algo("algo://util/Html2Text/0.1.4");
AlgoResponse result = algo.pipeJson(input);
System.out.println(result.asJsonString());
View Java Docs
import com.algorithmia._
import com.algorithmia.algo._

val input = {{input | formatInput:"scala"}}
val client = Algorithmia.client("YOUR_API_KEY")
val algo = client.algo("algo://util/Html2Text/0.1.4")
val result = algo.pipeJson(input)
System.out.println(result.asJsonString)
View Scala Docs
var input = {{input | formatInput:"javascript"}};
Algorithmia.client("YOUR_API_KEY")
           .algo("algo://util/Html2Text/0.1.4")
           .pipe(input)
           .then(function(output) {
             console.log(output);
           });
View Javascript Docs
var input = {{input | formatInput:"javascript"}};
Algorithmia.client("YOUR_API_KEY")
           .algo("algo://util/Html2Text/0.1.4")
           .pipe(input)
           .then(function(response) {
             console.log(response.get());
           });
View NodeJS Docs
import Algorithmia

input = {{input | formatInput:"python"}}
client = Algorithmia.client('YOUR_API_KEY')
algo = client.algo('util/Html2Text/0.1.4')
print algo.pipe(input)
View Python Docs
library(algorithmia)

input <- {{input | formatInput:"r"}}
client <- getAlgorithmiaClient("YOUR_API_KEY")
algo <- client$algo("util/Html2Text/0.1.4")
result <- algo$pipe(input)$result
print(result)
View R Docs
require 'algorithmia'

input = {{input | formatInput:"ruby"}}
client = Algorithmia.client('YOUR_API_KEY')
algo = client.algo('util/Html2Text/0.1.4')
puts algo.pipe(input).result
View Ruby Docs
use algorithmia::*;

let input = {{input | formatInput:"rust"}};
let client = Algorithmia::client("YOUR_API_KEY");
let algo = client.algo('util/Html2Text/0.1.4');
let response = algo.pipe(input);
View Rust Docs
Discussion
  • {{comment.username}}