University of California, Berkeley
Berkeley, CA

Masters in Information and Data Science

Relevant Coursework
Research Design, Applied Machine Learning, Data Engineering

Expected: August 2020

University of California, Berkeley
Berkeley, CA

Bachelor of Arts in Computer Science
GPA: 3.81

Relevant Coursework
Data Structures, Algorithms, Database Systems, Artificial Intelligence, Operating Systems, Data Science, Probability for Data Science

Graduated May 2019


Redmond, WA

Software Engineering Intern

I worked as a Production Infrastructure Engineer under the Azure Log Analytics team. My project revolved around increasing the scalability and the reliability of our Telemetry Pipeline by integrating a shared redis cache layer which decreases the number of expensive outbound data center calls by 80% in core pipeline components. Additionally, the redis integration drastically reduced by 50% the time it takes to start up new instances of our service as well as restart existing ones. Due to the success of the redis in our pipeline, I packaged and published a generalized redis client as a NuGET package for an easy integration of redis in other services.

May 2019 - August 2019

Palo Alto, CA

Software Engineering Intern

I worked on creating a question answering system which can parse human language and optimize workplace productivity by answering questions about VMware's organizational hierarchy structure. To store employee data, I designed and constructed a knowledge graph using Neo4j's graph database system that connects employee entities through hierarchical relationships. To carry out the language processing, I used RASA NLU with a Spacy backend which was able to interpret english queries and convert them into database queries. Finally, I deployed the entire project using Flask and created a Docker image for future company use. This project won 5th place in the annual company wide intern competition.

May 2018 - August 2018

UC Berkeley EECS Department
Berkeley, CA

Undergraduate Student Instructor

I served as an UGSI for CS61B, a course on Data Structures and Algorithms. I was responsible for teaching more than 30 students weekly in discussion, labs, office hour sections and participating in weekly staff meetings. I also assisted in proctoring and grading exams as well as creating new course related worksheets to test students understanding of the lecture material. Cummulatively have spent over 330 hours teaching this course.

January 2018 - May 2019s

Mobile Developers of Berkeley
Berkeley, CA

Senior IOS Developer

As part of Mobile Developers of Berkeley, I took part in a rigorous IOS training program that involved programming and designing 4 deployable applications in 5 weeks. In the later half of the same semester, I worked with a team with 2 other developers to research, develop, design, and deploy a full scale application to the Apple App Store. Currently working on a contract for JASStek Inc. to create an image recognition app which can simulate lip movements on a marine image.

January 2018 - May 2019

Center for Community Innovation
Berkeley, CA

Data Science Researcher

I worked on a research team at the Center of Community Innovation to conduct research on the travel pattern information released by the Emissions Application. Interacting with the application's MongoDB database, I was able to pull out the necessary user data needed and wrote helpful python scripts to aid in the data cleaning process. I utilized Pandas and Searborn to clean, analyze, and interpret trends in our data set.

July 2017 - December 2017s


view my projects here


Java, Python, Swift, SQL, C, HTML, CSS

Git, Neo4j, Firebase, Pandas, Seaborn

© Shubham Gupta 2018