Word vectors – also called word embeddings – are mathematical descriptions of individual words such that words that appear frequently together in the language will have similar values. In this way we can mathematically derive context. As mentioned above, the word vector for “lion” will be closer in value to “cat” than to “dandelion”.
Machine Learning algorithms don’t understand the textual data rather it understand only numerical data. So the problem is how to convert the textual data to the numerical features and further pass these numerical features to the machine learning algorithms.
As we all know that the raw text stored in some dump repository contains a lot of meaningful information. And in today’s fast changing world, it becomes essential to consider data driven decision than fully rely on experience driven decision.
Data Visualization is one of the important activity we perform when doing Exploratory Data Analysis. It helps in preparing business reports, visual dashboards, story telling etc important tasks. In this post I have explained how to ask questions from the data and in return get the self explanatory graphs. In this You will learn the use of various python libraries like plotly, matplotlib, seaborn, squarify etc to plot those graphs.
In today’s world artificial intelligence (AI) is making noise in every sector allowing computers to learn from and recommend the actions based on the previously collected data. Now a days, Human Resource department have no longer a manual work, AI technology offers the significant opportunities to improve the HR functions such as recruiting and talent acquisition, payroll, reporting, access policies and procedures etc. AI will help HR executives to enhance the overall employee experience this will give HR executive more time in budgeting, planning and making accurate decision on people management.
Data science and big data are no longer strange terms that are limited to techie vocabulary. In fact, the continuous expansion of the digital world has made data crucial for businesses to grow and succeed. This is why both terms have become more commonly used in industries other than IT.
Sentence Segmentation or Sentence Tokenization is the process of identifying different sentences among group of words. Spacy library designed for Natural Language Processing, perform the sentence segmentation with much higher accuracy. Spacy provides different models for different languages. In this post we’ll learn how sentence segmentation works, and how to set user defined segmentation rules.