You are on page 1of 3

Expertise

 Data and Quantitative Analysis  Big Data Queries and Interpretation


 Decision Analytics  Data Mining and Visualization Tools
 Predictive Modeling  Machine Learning Algorithms
 Data-Driven Personalization  Business Intelligence (BI)
 KPI Dashboards and BPI Plans  Research, Reports and Forecasts

 Developed intricate algorithms based on deep-dive statistical analysis and


predictive data modeling that were used to deepen relationships, strengthen
longevity and personalize interactions with customers.
 Analyzed and processed complex data sets using advanced querying,
visualization and analytics tools.
 Identified, measured and recommended improvement strategies for KPIs across
all business areas.

Data and Analytics Tools/Languages: Spark, SparkR, R, Python, Scala, Hive, SQL,
SAS, Tableau, SPSS, Hadoop, Stata, Google Analytics, Amazon Web Services
Publications and Presentations: Available at mariatannerphdportfolio.com

Have a Good knowledge of Python Programming.(2 years).


Worked on predictive analytics models, scripting models, Analysis of Large Data sets,
Scraping.
Written various predictive algorithm and analytics algorithm using
python(pandas,numpy) and perl(regular expressions).

Intemediate knowledge of :
Apache spark,
Scala
Spark SQL
Python Django.
Python Flask.
Spark Batch Processing.

have a 2 Years of experience in Data Scientist - Linear Regression, Logistic


Regression, Clustering, Time Series Analysis, Text Mining, Predictive Modeling, Model
& Model Implementation and Machine learning Techniques using with Statistical
Analysis tools R and Python. Statistical Analysis: Hypothesis Testing, Correlation
Analysis, Missing Data Imputation, Regression modelling &validation - Multivariate
Linear Regression, Logistic Regression. Data Mining: K-Means Clustering, Association
Rules, Decision Trees, Random Forest, Support Vector Machines, Naive Bayes.
Practitioner of Machine learning classification and clustering algorithms. Forecasting
Analytics: Time Series analysis using R , Time Series analysis using Regression.
Analyzing the Requirements and Datasets provided by the client for large volume of
data.
A professional with 3+ years experience in Advanced Analytics and Analytical
Projects.Have experience working with Ecomerece,Finance,Manufacturing and digital
Domains. Developed machine learning algorithms and innovative analytics tools using
Machine learning and Analytical thinking. Built spyders and web scrapers to scrape
many financial, eCommerce and SEC sites and developed tools on the data scraped.
With the experience in NLP and text analytics drawn sentiments from news articles and
10K filings of SEC. Popularly known for out of box thinking and leadership qualities.
Handled team of data scientist in delivering projects for Investment firms.

Machine Learning skills: Linear,Logistic,SVM,Random forest,ANN,KMeans,KNN,Nueral


Networks,Tensor Flows and TF learn.Have expertise in Web Scraping,Supervised
,Unsupervised learning and Natural Language processing. Have worked in developing
complete end to end solutions using machine learning . Right from Data gathering,Data
Cleansing,Variable selection and coming out with best subset keeping in mind the curse
of dimensionality. Worked immensely on NLTK,TKinter,Pyqt,Pandas,Numpy,Beautiful
Soup,RE,Newspaper and many API's

Developed sentiment analysis tools on Twitter,Facebook and Glass door data

Proficient in Python,R ,Matlab and VBA

In my work as a Data Scientist, I've had to use a lot of clunky and counter intuitive
algorithms. Curious as to why these machine learning techniques worked the way they
do, I started teaching myself Statistics, Programming and Business. Almost immediately,
I knew this is what I wanted to do with my life. And I love to learn more!

My day at work starts with data extraction from database, cleaning the data , drawing
business insights and developing predictive models. Coding in Python, R has became a
daily routine.

“He who would search for Pearls must dive below"


Often one need to go deeper than the surface-level information to uncover the valuable
insights hidden beneath. I am sure that my curiosity and experience would serve as my
gills while I dive in search of pearls.

Want to find out more about me and my work. Reach me @puja.gangarapu@gmail.com.


I am more than happy to share.

Advanced Analytics professional with over a decade of experience with Data mining
projects from development to execution and from requirement gathering to making
deliverable a solid part of an organization’s strategy.
The main tools include SPSS, SAS, KXEN and WEKA. R has been used in algorithm
design during educational research.
Apart from above, expertise in TERADATA SQL and TERADATA Warehouse miner for
building end to end Analytical datasets and models.
Leading a team of professionals to drive analytics within and outside the organization
supporting data insight initiatives to facilitate Banks, HORECAs and Brands with
Telecom Data.
Built frameworks for advanced analytics funneling in Event stream processing tools for
real time decision making like SAS RTDM, IBM Info- streams, IBM UNICA.
Growing the core by giving not only advanced analytics insights but coupling them with
business analysis and visual anaytics in the form of dashboards and drill down
interfaces like Tableau.

You might also like