Subset Sequence Frequency in Set

No algorithm description given

Fast Algorithm to compute the most common prefixes in a large dataset.  AKA "Starting Pattern Occurrence Frequency " Results ranked by the most common. Applications: Next letter prediction Word completion DNA sequencing Protein sequencing Computational linguistics Compression algorithms Optional parameters: minScore - result frequency cutoff [default: 2] minLength - minimum prefix length [default: 1] startsWith - fixed prefix filter, useful for predictions (not used by default) maxResults - return at most this many results (by default return all matching) Examples: { "minLength": 4, "maxResults": 10, "dataset": ["John", "William", "James", "Charles", "George", "Frank", "Joseph", ...] }  Returns the top10 Baby Name prefixes, minimum length 4, from a 20th century US Baby Names list (2.5Mb), example dataset trimmed, result: {"Mari": 1941, "Fran": 1420, "Chris": 1227, "Chri": 1227, "Juli": 1167, "Will": 1151, "Char": 1066, "Christ": 1057, "Marg": 1041, "Kath": 983} Sample input and output against a "buzzwords" list:

Tags
(no tags)

Cost Breakdown

3 cr
royalty per call
1 cr
usage per second
avg duration

Cost Calculator

API call duration (sec)
×
API calls
=
Estimated cost
per calls
for large volume discounts
For additional details on how pricing works, see Algorithmia pricing.

No permissions required

This algorithm does not require any special permissions.


To understand more about how algorithm permissions work, see the permissions documentation.

1. Type your input

2. See the result

Running algorithm...

3. Use this algorithm

curl -X POST -d '{{input | formatInput:"curl"}}' -H 'Content-Type: application/json' -H 'Authorization: Simple YOUR_API_KEY' https://api.algorithmia.com/v1/algo/CS/SubsetSequenceFrequency/0.1.0
View cURL Docs
algo auth
# Enter API Key: YOUR_API_KEY
algo run algo://CS/SubsetSequenceFrequency/0.1.0 -d '{{input | formatInput:"cli"}}'
View CLI Docs
import com.algorithmia.*;
import com.algorithmia.algo.*;

String input = "{{input | formatInput:"java"}}";
AlgorithmiaClient client = Algorithmia.client("YOUR_API_KEY");
Algorithm algo = client.algo("algo://CS/SubsetSequenceFrequency/0.1.0");
AlgoResponse result = algo.pipeJson(input);
System.out.println(result.asJsonString());
View Java Docs
import com.algorithmia._
import com.algorithmia.algo._

val input = {{input | formatInput:"scala"}}
val client = Algorithmia.client("YOUR_API_KEY")
val algo = client.algo("algo://CS/SubsetSequenceFrequency/0.1.0")
val result = algo.pipeJson(input)
System.out.println(result.asJsonString)
View Scala Docs
var input = {{input | formatInput:"javascript"}};
Algorithmia.client("YOUR_API_KEY")
           .algo("algo://CS/SubsetSequenceFrequency/0.1.0")
           .pipe(input)
           .then(function(output) {
             console.log(output);
           });
View Javascript Docs
var input = {{input | formatInput:"javascript"}};
Algorithmia.client("YOUR_API_KEY")
           .algo("algo://CS/SubsetSequenceFrequency/0.1.0")
           .pipe(input)
           .then(function(response) {
             console.log(response.get());
           });
View NodeJS Docs
import Algorithmia

input = {{input | formatInput:"python"}}
client = Algorithmia.client('YOUR_API_KEY')
algo = client.algo('CS/SubsetSequenceFrequency/0.1.0')
print algo.pipe(input)
View Python Docs
library(algorithmia)

input <- {{input | formatInput:"r"}}
client <- getAlgorithmiaClient("YOUR_API_KEY")
algo <- client$algo("CS/SubsetSequenceFrequency/0.1.0")
result <- algo$pipe(input)$result
print(result)
View R Docs
require 'algorithmia'

input = {{input | formatInput:"ruby"}}
client = Algorithmia.client('YOUR_API_KEY')
algo = client.algo('CS/SubsetSequenceFrequency/0.1.0')
puts algo.pipe(input).result
View Ruby Docs
use algorithmia::*;

let input = {{input | formatInput:"rust"}};
let client = Algorithmia::client("YOUR_API_KEY");
let algo = client.algo('CS/SubsetSequenceFrequency/0.1.0');
let response = algo.pipe(input);
View Rust Docs
Discussion
  • {{comment.username}}