Machine learning

Algorithms that independently learn to make predictions and decisions after training on datasets instead of being explicitly programmed.

Sort by
3 September 2020
8 min read
Bayesian ML is a paradigm for constructing statistical models based on Bayes’ Theorem. Learn more from the experts at Algorithmia.
2 September 2020
11 min read
XGBoost is a popular library among machine learning practitioners, known for its high performance and memory efficient implementation of gradient boosted decision trees. Since training and evaluating machine learning models on Jupyter notebooks is also a popular practice, we’ve developed a step-by-step...
27 August 2020
8 min read
Neural networks, as their name implies, are computer algorithms modeled after networks of neurons in the human brain. Learn more about neural networks.
11 August 2020
5 min read
Semi-supervised learning is the type of machine learning that uses a combination of a small amount of labeled data and a large amount of unlabeled data to train models. This approach to machine learning is a combination of supervised machine learning, which uses labeled training data, and unsupervised...
6 August 2020
5 min read
Computing has the power to do some of the things that the human brain can do, thanks to advances in artificial intelligence. One of those advances is text processing, which also relates to natural language processing. This article is a deep dive into what text processing is and how it can generate value...
4 August 2020
5 min read
A data lake is a centralized repository of all an organization’s data stored in its raw format. This allows enterprises to store all their data, in its natural or raw state, in one location. This includes structured, relational data with rows and columns, semi-structured data such as CSV or XML files,...
29 July 2020
9 min read
The field of natural language processing (NLP) is concerned with the creation of machine learning methods for understanding written and verbal data. And as in any subfield of machine learning, it’s necessary to devise a technique for creating numerical representations of that data so it can be acted...
28 July 2020
6 min read
Applied machine learning is the application of machine learning to a specific data-related problem. This machine learning can involve either supervised models, meaning that there is an algorithm that improves itself on the basis of labeled training data, or unsupervised models, in which the inferences...
24 July 2020
6 min read
Data democratization, the process of allowing as many people as possible to have access to data without any bottlenecks or gatekeepers, can happen both within and between organizations. Within an organization, data democratization might mean that the IT department makes data easily and readily accessible...
21 July 2020
6 min read
One of the final (and arguably most important steps) in developing a machine learning model is evaluating its accuracy. You can’t trust a model to make good predictions about new and unknown data if it’s struggling with training data. Regression models evaluating accuracy usually means calculating...