About me
Aspiring data scientist and data analyst with a solid foundation in mathematics and statistics, driven by a passion for uncovering patterns and discovering hidden insights in data. I thrive on solving complex problems and finding creative solutions that make a meaningful impact. Beyond my analytical pursuits, I’m a trained Indian classical dancer and a music enthusiast currently learning the ukulele. These creative outlets have taught me discipline, precision, and the beauty of patterns—not just in numbers but also in rhythm and movement. I believe my unique blend of analytical skills and artistic perspective allows me to approach challenges with both logic and creativity, making me adaptable and innovative in the ever-evolving field of data science.
Technical Skills:
- Data Visualization Tools and Frameworks: Microsoft Power BI, Tableau, MS Excel, Streamlit
- Databases: MySQL, PostgreSQL, Neo4J, MongoDB, Cassandra
- Programming Languages: Python, R, SQL
- Machine Learning Techniques: K-Nearest Neighbors (KNN), Naive Bayes, Decision Trees, Regression, K-Means Clustering, Gradient Descent
- Libraries/Frameworks: NumPy, Pandas, Matplotlib, Seaborn, Scikit-learn, SciPy, TensorFlow, Keras, PyTorch, Statsmodels, plotly
- Development Platforms: Anaconda, Jupyter Notebook, PyCharm, VS Code, GitHub Googlr Colab
Education
University of California, Irvine
Master of Data Science
GPA: 3.95 During my master's, I had the opportunity to integrate my background in mathematics and statistics with the principles of computer science. I gained a deeper understanding of how statistics applies in real-world scenarios through courses in Linear regression, Bayesian statistics, and Probability theory. To strengthen my computer science foundation, I took classes in Data structures, Algorithms, and Database management systems. Additionally, courses like Big Data Management, Machine learning and Probabilistic models introduced me to new technology and kept me updated on current developments in the AI field.
MIT World Peace University, Pune, India
Master of Science, Statistics
GPA: 3.94 During my undergraduate studies, I found statistics to be particularly fascinating and wanted to explore it in greater depth. I took courses such as Asymptotic Inference, Clinical Trials, Time Series Analysis, and Hypothesis Testing etc., which further fueled my interest in the field. I also had the opportunity to take an introductory course in Machine Learning, which I found incredibly exciting. This experience sparked my decision to pursue a career in data science, where I could combine my love for mathematics, statistics, and problem-solving to derive meaningful insights from data.
I received Dr. Vishwanath Karad Merit Scholarship for academic year 2022-23.
Savitribai Phule Pune UniversityFergusson College, Pune, India
Bachelor of Science, Mathematics
GPA: 3.93 I have been passionate about mathematics for as long as I can remember. I’ve always loved playing with numbers and exploring the fundamental principles that explain how things work. This curiosity led me to pursue a bachelor's degree in Mathematics, with minors in Physics and Statistics. During my undergraduate studies, I delved into topics such as Real Analysis, Complex Analysis, Combinatorics, Linear Algebra, and Differential Equations. These subjects not only deepened my understanding of mathematics but also honed my logical reasoning and critical thinking skills.
Experience
Innopiphany
Data Analyst | Sept 2024 - Dec 2024
Innopiphany is a data-driven consulting firm specializing in providing innovative forecasting and strategic solutions for the pharmaceutical
and healthcare industries.
My capstone team partnered with Innopiphany to provide them a forecasting tool that aids early-stage pharmaceutical companies about anticipated
government spending for their upcoming drugs.
- Integrated openFDA and financial datasets via API calls using Python (Pandas, NumPy) for efficient data retrieval and processing.
- Conducted exploratory data analysis (EDA) with Seaborn and Matplotlib to identify key predictors and uncover top drug candidates in the market.
- Designed a forecasting method leveraging analog drug data, utilizing curve fitting with moving averages to predict government spending on newly launched drugs.
- Collaborated on the development of a user-friendly, web-based UI using Streamlit for interactive data visualization and informed decision-making. UI Demo
UCI School of Medicine
Data Analyst
The goal of this project was to develop an automated voice classification system that could classify speech as cleft or non-cleft, reducing the reliance on manual assessments by clinicians, ultimately leading to faster diagnosis and better accessibility to treatment.
- Implemented an audio processing pipeline in Python using Parselmouth to extract key audio features from speech samples.
- Developed and trained an LSTM-based neural network (SpeechLSTM) on the extracted features to classify cleft and non-cleft speech.
- Evaluated the model’s performance by measuring accuracy (58.50%) and Binary Cross-Entropy Loss (0.6759) to assess its classification accuracy.
Karkinos Healthcare Pvt. Ltd.
Data Analyst Intern | Feb 2023 - Jul 2023
Karkinos Healthcare is an oncology-focused health-tech company dedicated to improving cancer care through early detection, advanced diagnostics, and patient-centric solutions.
- Led data cleaning efforts to identify issues in data collection for timely medical interventions based on cancer risk scores.
- Created dashboards in Excel using pivot tables and Power BI for accessible patient data across multiple states in India.
- Conducted data sampling for quality assurance in tele-screening, identifying outliers to enhance cancer care standards.
- Volunteered at medical camps, collecting pre-cancer screening data from over 100 patients in rural communities.
Paathshala Education
Educator and Academic Content Creator | Dec 2020 - Jul 2021
- Tutored over 20 high school students in mathematics through personalized one-on-one sessions.
- Designed tailored teaching strategies for each student, focusing on their individual strengths and addressing areas for improvement.
- Created over 300 simplified tutorial solutions to improve comprehension and boost academic performance for asynchronous learning during lockdown.
Bilvaani School of Dance and Movement based learning
Trainer | Feb 2020 - Feb 2023
- Trained a batch of 15 students, aged 7 and above, in Kathak for over a year, fostering discipline and creativity through classical dance.
- Assisted elementary school students in understanding mathematical concepts by incorporating dance, movements, and rhythm, effectively addressing learning loss during the lockdown.
- Coordinated a stage performance featuring 20+ children, showcasing Kathak recitals and movement-based compositions that creatively illustrated concepts in math, history, geography, and languages.
Projects
Academic Performance Analysis Using Regression Model using Python
This project aims to analyze Student Performance Data. The goal is to understand the patterns that can help in recognizing students at risk of under performing, thereby facilitating the development of targeted interventions and support strategies to improve their academic achievements.
Sales Performance Analytics Dashboard using PowerBI
Hierarchical Analysis of Power Consumption Data using R
Landed Cost Analysis & Tariff Optimization
Online Retail Sales Analysis using SQL and PowerBI
This project involved cleaning, transforming, and analyzing e-commerce sales data to uncover business insights using SQL for data querying and Power BI for interactive dashboard creation. The dataset included over 500,000 transactions across multiple countries.
Predicting Patient Treatment Continuation using Python
This project aimed to develop and evaluate multiple machine learning models to predict whether a decision regarding a proposed health plan would be upheld or overturned. We explored different models like Random Forest, Decision Tree, Multi-Layer Perceptron, Logistic Regression, and XGBoost.