Author Image

Hi, I am Pragati

Currently working as Data Analyst and contributing to the enterprise in automating tasks through machine learning, building data pipelines, creating effective business visualizations. I completed my MS in data science from the Illinois Institute of Technology. Areas of specialization during my master's program are: Machine Learning, Database Organization, Applied Statistics, Data Preparation and Analysis and Statistical learning. I worked as a summer Data Scientist trainee at CCC Solutions under my Data Science practicum training and as a Machine learning Engineer intern at Visteon Corporation (India) during my undergraduate degree.

I am always open to interesting and engaging conversations.

My personal website: https://pragati2.github.io/PragatiKhekale.github.io/

Leadership
Team Work
Curiousity
Hard Working
Solution Oriented

Experiences

1
Data Analyst
Neebal Tech

August 2023 - present, Dallas, Tx

Responsibilities:
  • Currently working on developing depth prediction for unlocatable data points using ArcGIS pro and KNN Regression and Deep learningin python, to resolve the unlocatable tickets with an estimated x,y, z coordinates for mapping
  • Leading a project to create a Data Warehouse and organize the team’s data
  • Lead and created a KPI tracking initiative to track integrity progress across groups under pipeline integrity using tableauvisualizations
  • Developed Enterprise wide Annual Report (Phmsa reporting) in Tableau using Scatter Plots, Geographic Map, Pie Charts andBar Charts and Density Chart
  • Published a custom GIS map for reporting purpose using ArcGIS
  • Developed Tableau data visualization using Scatter Plots, Geographic Map, Pie Charts and Bar Charts and Density Chart. Evaluated data profiling, cleansing, integration and extraction tools

ACMW Tutor
Illinois Institute of Technology

September 2021 - December 2022, Chicago, Illinois

ACM Women Tutor at Illinois Institute of Technology for Machine Learning, Statistical Learning

Responsibilities:
  • Tutored and guided student throughout execution of machine learning projects.
  • Helped in providing analytical insights through the given results: Generating analytical insights from 5000 data points, to interpret a trend in covid based on travel history of the sample dataset.
2

3
Data Scientist Intern
CCC Intelligent Solutions

May 2022 - August 2022, Chicago, Illiois (remote)

Responsibilities:
  • Worked with implementing machine learning model, with tabular dataset to test the SISA architecture
  • Evaluated SISA architecture and obtain improved model where average accuracy is maintained for modified training data.
  • Achieved 96% accuracy on CNN model, improved the model accuracy by 5.56% through modifying SISA parameters
  • Contributed towards model assessment to generate analytical insights in 6 scenarios for 250,000 data points
  • Performed comparative data analysis for performance of SISA model on different shards to evaluate model in real time using Python

5
Machine Learning Engineering Intern
Visteon Corporation

June 2019 - September 2019, Pune, Maharashtra, India

Machine Learning (ML) Engineering Intern at Visteon Corporation, ADAS team

Responsibilities:
  • Co-ordinated with ADAS team for interior sensing project for PoseNet activity detection model
  • Developed and modified model to generate alert for user by pose estimation model (computer vision application)
  • Evaluated OpenMax OSDN model achieved object detection and improved open set deep networks classifier
  • Technical Skills utilized: Python, C++, C, CMake, keras, TensorFlow, pandas, OpenCV

Education

January 2021 - December 2022
MS in Data Science
Courses Taken
  • Data Preparation and Analysis
  • Machine Learning
  • Database Organization
  • Advanced Database Organization
  • Pribability and Statistics
  • Statistical Learning
  • Applied Statistics
  • Monte-Carlo methods in Finance
  • Project Management
August 2016 - May 2020
Bachelors of Engineering/Technology in Electronics and Telecommunications
Relevant Courses Taken
  • Introduction to Machine Learning
  • Artificial Intelligence
  • Digital Image and Video Processing
  • Computer Networks and Security
  • Data Structures and Algorithms
  • Object Oriented Programming

Projects

Machine Unlearning for Data Privacy
Developer May 2022 - August 2022

  • Constructed data privacy in SISA architecture using machine unlearning algorithms.
  • Achieved 96% accuracy on the model and analyzed the results on different machine learning models.
  • Generated comparative analysis for performance of SISA model on different shards to evaluate model in real time.
  • Black Friday Prediction
    Developer August 2022 - December 2022

    Used a tabular dataset for black friday sales prediction collected from one store. Trained Neural Network with three different regression models- a) Decision Tree regressor, b) Random Forest Regressor, and c) Gradient Boosting Regressor. Performed preprocessing of data by cleaning the data, performed indivisual variable analysis and multi variate analysis. Trained the data using three regressors and evaluated the model using RMSE score for the three models.

    OpenMax Open Set Deep Networks
    Developer August 2021 - December 2021

    Implemented transfer learning model for increasing model accuracy for object detection using improved activation function. This improved model is implemented using open set deep networks, where the open set classifier is subjected to openMax activation function. First model for the transfer learning approach was trained on CIFAR-10 dataset to obtain weights to be used on MNIST dataset. Training of the model was performed using tensorflow deep networks with depth of 4 convolution layers. This model resulted in reducing false positives given during object detection.

    Data Analysis for Effect of Unemployment on Box Office Collection using R-Analysis
    Developer May 2022 - August 2022

    This project aims to measure how reviews and ratings of movies released in theaters relate to their sales at the box office. The analysis addresses how the general sentiments of the public towards specific theatrical releases relate to box office results could bring substantial benefits to the movie industry. Four different datasets were used for the analytical prediction from the data collected were curatted using: 1)The Movies Dataset 2)IMDB movies dataset 3)US Unemployment dataset 4)Bureau of Labor Statistics Unemployment rates. Data was preprocessed and cleeaned for all the given datasets in order to combine them together. The combined data was analyzed to perform bivariate analysis and for Modelling. Modelling was implemented using both Supervised (Regression and Classification) and Unsupervised (K-Mean clustering) models.

    Airline-Flight Booking Application
    Developer January 2021 - May 2021

    Used PostgreSQL to create and manage dataset for airline booking information of costumers in this project. Also used java based web appplication for the front end implementation, included 20 rows of data in the PostgreSQL to store and process.

    Big Data AWS ML analysi
    Developer Jan 2021 - May 2021

    Analyzed different services provided by AWS for implementing machine learning models Analysis using different AWS technologies (S3, EC2). Designed and analyzed classifier machine learning models like: Regression, Decision Tree, SVM, Gradient Boost.

    Oral Cancer Detection using Deep Learning
    Developer August 2019 - August 2020

    This project originated from an idea aimed at real world problem solving using Computer Vision. There were not many machine learning applications targeting oral cancer. Vision for the project: to make an accessible application to generate alerts for users to consult and get tested for oral cancer. Performed tasks like data collection, labeling, data cleaning, preprocessing of the images for the model. Multiple approaches were tested for this problem such as: Transfer learning, CNN. From testing of different approaches for results based on parameters like accuracy, error rate VGG-16 architecture was successfully implemented. Results from the model is a binary classification (If the given image is cancerous or not).

    Black and White Space Detection
    Developer January 2019 - May 2019

    This is a hardware based device which is used in detecting white spaces where the applications range from finding the number of words on a paper to detecting white spaces in chess boards. This was arduino based project, objective was to detect black & white space in between characters & symbols.

    Certifications

    Udemy Deep Learning Pre requisite: Numpy stack
    Udemy Course September 2020 - November 2020

    This is a begining to start with Deep Learning. Courses:

    1. Foundations: Algebra, Gaussian Distribution, Python coding - Certification
    2. Fundamental machine learning concepts in python: Numpy, Pandas, Matplolib, SciPy - Certification
    3. Understand supervised machine learning (classification and regression) with real-world examples using Scikit-Learn - Certification
    4. Understand and code using the Numpy stack - Certification
    5. Make use of Numpy, Scipy, Matplotlib, and Pandas to implement numerical algorithms - Certification
    6. Understand the pros and cons of various machine learning models, including Deep Learning, Decision Trees, Random Forest, Linear Regression, Boosting, and More! - Certification

    Neural Network using Python for Beginners
    Udemy Course August 2020 - September 2020

    Course for Basic conceptual understanding of ANN

    1. Identify the business problem which can be solved using Neural network Models - Certification
    2. clear understanding of Advanced Neural network concepts such as Gradient Descent, forward and Backward Propagation etc. - Certification
    3. Create Neural network models in Python using Keras and Tensorflow libraries and analyze their results. - Certification

    Awards

    All India Merit Scholarship
    2013
    Rank 345,All India Open Mathematics Scholarship Examination (IPM), India.
    Merit Scholarship (high school)
    2016
    Top 10% in All India Central Board of Secondary Education, Senior Secondary Curriculum, 2014

    Activities

    Jan'21 - Dec'22
    Association for Computing Machinery (ACM), Illinois Institute of Technology, Chicago, IL.
    IEEE Student Branch, Pune Institute of Computer Technology, Pune, India.
    Volunteered for Event management for Paper presentation for Credenz 2018. Part of backend team for Credenz game (Linja).

    Organizations

    Student Member