A.I, Data and Software Engineering

Search results foralgorithm

K-Means vs K-Nearest neighbours quick note

petamind

These are completely different methods in machine learning. The fact that they both have the letter K in their name is a coincidence. K-means is a clustering algorithm that tries to partition a set of points into K sets (clusters) such that the points in each cluster tend to be near each other. It is unsupervised because the points have no external classification. The typical k-means...

Lasso vs Ridge vs Elastic Net – Machine learning

petamind

Lasso, Ridge, and Elastic Net are excellent methods to improve the performance of your linear model. This post will summarise the usage of these regularization techniques. Bias: Biases are the underlying assumptions that are made by data to simplify the target function. Bias does help us generalize the data better and make the model less sensitive to single data points. It also decreases the...

Understanding Latent Dirichlet Allocation (LDA)

petamind

Imagine a large law firm takes over a smaller law firm and tries to identify the documents corresponding to different types of cases such as civil or criminal cases which the smaller firm has dealt or is currently dealing with. The presumption is that the documents are not already classified by the smaller law firm. An intuitive way of identifying the documents in such situations is to look for...

Number of Islands solution

petamind

In this post, we have a look at using a queue and breath-first search algorithm to solve a Leetcode challenge. The problem is stated as follows. Given an m x n 2D binary grid grid which represents a map of '1's (land) and '0's (water), return the number of islands. An island is surrounded by water and is formed by connecting adjacent lands horizontally or vertically. You may assume all four edges...

SAMPLING Bagging vs Pasting

One way to get a diverse set of classifiers is to use very different training algorithms,as just discussed. Another approach is to use the same training algorithm for everypredictor but to train them on different random subsets of the training set. Whensampling is performed with replacement, this method is called bagging(short for bootstrap aggregating). When sampling is performed without...

A.I, Data and Software Engineering

PetaMinds focuses on developing the coolest topics in data science, A.I, and programming, and make them so digestible for everyone to learn and create amazing applications in a short time.

Categories