• Around 1.2 years of professional IT experience in product-based industry as well as service -based industry in the field of data science with predictive analytics, machine learning, reporting and ETL.
• Sound knowledge of Machine Learning algorithms, ETL Processes (Ab-initio) along with statistical (R), programming (Python) and data manipulation language (SQL).
• Efficient to draw insights from large data sets, Predictive analytics, Text Mining, R- Analytics, Python and generating dashboards/reports using Tableau, Excel.
• Good knowledge of Advanced Excel, SQL.
• Exploring Machine learning with Python.
Technical Expertise:
• Languages & scripting : Python, R, SQL
• Data science Toolkit :
◦ Python Modules/ Libraries: Jupyter Notebook, Web Scrapping, Pandas, Matplotlib.
◦ Machine Learning (R, Python): Simple and Multiple Linear Regression, Logistic Regression, Classification using KNN, Decision Tree, Random Forest, Naive Bayes, Support Vector Machine, Association Rule Mining using Apriori, Unsupervised Learning with PCA, Factor analysis, K-Means clustering, etc.
◦ Statistics: Statistical understanding of model, Model Interpretation, Overfitting, Under fitting, Properties of distributions, statistical tests and proper usage and their real- world advantages/drawbacks
◦ Reporting: Matplotlib, ggplot, Cognos.
• OS and Other Tools :
◦ Windows, Linux, putty, Rally
◦ PyCharm, R-studio, Anaconda, Spyder, Jupiter, Ab-initio, SQL-Developer