In today’s world we are generating large amount of data every second. while tweeting, chating, writing or even speaking, we are fabricating corpse of data. Most of the data is in textual and unstructured form. Hence to make this data understandable by computer, we need to process it. NLP technique helps us in processing the data and helps us to get useful insights from it.Read mor
The Science of collecting, organizing, presenting, analyzing and interpreting the data is statistics. It is one of the most important disciplines or methods to get a deeper insight into data. Statistical analysis is implemented to manipulate, summarize and investigate data so that useful information can be obtained.
Take away from this post:
- Types of Statistics: Descriptive vs Inferential
- Basic terminology like Population vs Sample
- Types of Variables: Numerical vs Categorical
- Measures of central tendencies: Mean, Median and Mode and their specific use cases
- Measures of dispersion/spread: Variance, standard deviation etc.
The coefficient of Determination is the direct indicator of how good our model is in terms of performance whether it is accuracy, Precision or Recall. In more technical terms we can define it as The Coefficient of Determination is the measure of the variance in response variable ‘y’ that can be predicted using predictor variable ‘x’. It is the most common way to measure the strength of the model.Continue reading “What is the Coefficient of Determination | R Square”
Storytelling or presenting insights is the most important part of data analytics. This is the selling point of all your hard work. Doesn’t matter how much hard work you have put in developing analytic model until you are able to get the attention of the target audience. Here in this particular article, my focus is on how we can use beautiful graphs to show the insights regarding employee attrition rate from IBM HR Attrition data. After all, a picture is worth to thousands of words.Continue reading “Employee Attrition Rate Analysis – Insights from IBM HR Data”
In any business there are some easy to measure variables like : Age, Gender, Income, Education Level etc. and there are some difficult to measure variables like amount of loan to give, no of days a patient will stay in the hospital, price of the house after 10 years etc. So Regression is the technique which enables you to determine difficult to measure variables with the help of easy to measure variables.Continue reading “What is Linear Regression? Part:2”