While working in regression analysis, you should be familiar with some very basic but very impactful concepts. In machine learning interviews, you can always expects questions from regression analysis. Regression analysis also develop the basic understanding of machine learning model building as we mostly start our machine learning journey from regression analysis only.
Feature store in machine learning is the concept to store features in both online and offline stores for model training and serving purposes. Feature store make sure to provide the consistency between the data used for model training and the data used during online serving to models. In other words, it guarantees that you’re serving the same data to models during training and prediction, eliminating training-prediction skew. Feast is one of the open source tools used for feature store.
Experiment tracking is the process of recording all the important components such as hyper parameters, metrics, models and artifacts like plots PNG images, files etc. Experiment tracking helps to reproduce the old results by using the stored parameters.
Statistics is a subject and a branch of mathematics that is related to all the collection, analysis, interpretation, and visualization of empirical data, and there are two major areas of statistics are descriptive statistics and inferential statistics. If we talk about, descriptive statistics are used to describe the characteristics of sample and population data (what has happened). These properties are used by inferential statistics to test hypotheses, reach conclusions, and make predictions (what can you expect).
Hypothesis testing helps us to validate the various claims made by different people in different scenario. For example if we claim that there is no significant difference between boys and girls intelligence level. So can we validate it significantly? Or can we validate that smoking causes cancer?
Decision Tree is supervised machine learning algorithm which is used for both types of problems regression (that is predicting the continuous value for future example house price, hours the match can be played given overcast condition etc…) and classification (that is classifying different objects into respective categories or classes for example given the overcast conditions match will be played or not, given image belongs to cat or dog etc…).