In real-world data, there are some instances where a particular element is absent because of various reasons, such as corrupt data, failure to load the information, or incomplete extraction. Handling the missing values is one of the greatest challenges faced by analysts because making the right decision on how to handle it generates robust data models. Let us look at different ways of imputing...
Network protocols quick notes
In the world of technology, there are vast numbers of users’ communicating with different devices in different languages. That also includes many ways in which they transmit data along with the different software they implement. Yes, we’re talking about “networking protocols” which are a set of rules that help in governing the way a particular technology will function for...
THE JOEL TEST quick note: 12 STEPS TO BETTER CODE
Recently, I came across a company’s profile and found their Joel test. Basically, it is for a better quality of software team/company. So I decided to share it here in the simplified version. The great part about it is that it takes about 3-5 minutes compared to approx 6 years to understand SEMA. The Joel Test Joel test questions The neat thing about The Joel Test is that it’s easy to get a...
Decision Tree Regression quick note
Decision tree builds regression or classification models in the form of a tree structure. It breaks down a dataset into smaller and smaller subsets while at the same time an associated decision tree is incrementally developed. The final result is a tree with decision nodes and leaf nodes. A decision node (e.g., Outlook) has two or more branches (e.g., Sunny, Overcast and Rainy), each representing...
Feature Engineering FundamentalS
The features you use influence more than everything else the result. No algorithm alone, to my knowledge, can supplement the information gain given by correct feature engineering.— Luca Massaron What is a feature and why we need engineering of it? Basically, all machine learning algorithms use some input data to create outputs. This input data comprise features, which are usually in the form...