Posts by James Sutton
Sort by
8 October 2020
7 min read
In this blog post, we discuss Spark and Spark Pipelines—and how you might be able to export a critical component from your Spark project to Algorithmia by using the MLeap model interchange format and runtime.
21 April 2020
5 min read
In February we announced our support of GitHub as a repository storage system for any algorithm on Algorithmia. This is big news and it unlocks a lot of potential for developers. In this post, we’ll show you how integrating with GitHub can enable new workflows and automation systems that can magnify...
11 March 2020
8 min read
Natural language processing has been one of the most poignant and visible uses of machine learning capabilities in recent years. From the basics of recurrent neural network architectures that were able to detect the first named entity pairings, to now where transformers are able to look at an entire...
30 July 2018
6 min read
Source: TDS We recently went into detail about the Document Classifier algorithm in our spotlight. That’s all fine and good, but it’s not immediately clear what can you do with it. In this post, we’ll focus on potential use cases. We’ll start with a quick refresher on what this algorithm...
10 July 2018
12 min read
As many applied ML practitioners know, productionizing ML tools can be deceptively difficult. At Algorithmia we’re always striving to make our algorithms the best in class, and we’ve recently made a series of performance and UX changes to our Document Classifier algorithm, and put work towards...
28 November 2017
7 min read
Let’s play a game: can you tell the difference between these two sentences? “Most of the time, travellers worry about their luggage.” “Most of the time travellers worry about their luggage.” Whoa, remove the comma and all of a sudden we’re having an entirely different...
26 October 2017
40 min read
The new Open Images dataset gives us everything we need to train computer vision models, and just happens to be perfect for a demo! Tensorflow’s Object Detection API and its ability to handle large volumes of data make it a perfect choice, so let’s jump right in… Open Images is a...
21 September 2017
3 min read
This is easy to understand, right? How about this? A bit harder? Are you able to decipher this one at all? courtesy of Faris Algosaibi The first example can be easily recognized by most character recognition algorithms. However, as your text gets progressively more complex, this seemingly simple task...
7 August 2017
5 min read
Figuring out the meaning of a document was once a very hard problem for computers to solve… even for humans, understanding the complexity of natural language can be tricky! Fortunately, there are some great tools that can help address those concerns. The Document Classifier turns your existing...
1 August 2017
2 min read
Automatically downloading video content from different types of web sources can be tricky. Not every website hosts files the same way, and sometimes hosting providers try to prevent people from programmatically downloading stuff. Thankfully, we have an algorithm for that! What is the Smart Video Downloader? The...