datalogue

datalogue / html2json / 0.1.0

README.md

Overview

This algorithm can be used to change an HTML file containing a json document

Usage

Input

ParameterDescription
inputFileUriUri of the file as it is accessible on the algorithmia platform
outputFileUriUri to be outputed on the algorithmia platform

Output

The output is a file with a json document

Examples

An html like this

<!DOCTYPE html>
<html>
<head>
<title>Page Title</title>
</head>
<body>

<h1>This is a Heading</h1>
<p>This is a paragraph.</p>

</body>
</html>

Will be converted to this

{"html":{"head":{"title":{"text":"Page Title"}},"body":{"h1":{"text":"This is a Heading"},"p":{"text":"This is a paragraph."}}}}