-
Notifications
You must be signed in to change notification settings - Fork 0
/
index.html
195 lines (181 loc) · 11 KB
/
index.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="UTF-8">
<title>Shiv Munagala</title>
<link rel="stylesheet" href="style.css">
<!-- Font Awesome Icons -->
<link rel="stylesheet" href="https://pro.fontawesome.com/releases/v5.10.0/css/all.css" />
</head>
<body>
<div class="container">
<!-- Navigation -->
<div class="nav">
<div id="title"><b>Shiv Munagala</b></div>
<div class="nav-content">
<ul>
<li><a href="mailto:[email protected]" target="_blank"><i class="fas fa-solid fa-envelope fa-2x"
alt="Email"></i></a></li>
<li><a href="https://www.linkedin.com/in/shiv-munagala/" target="_blank"><i
class="fab fa-linkedin fa-2x" alt="LinkedIn"></i></a></li>
<li><a href="https://github.com/ShivMunagala" target="_blank"><i class="fab fa-github fa-2x"
alt="GitHub"></i></a></li>
</ul>
</div>
</div>
<!-- Introduction -->
<p class="intro-text">
Data Scientist. Mathematics Master's (MMath) graduate from the University of Oxford.
</p>
<!-- Experience Section -->
<section id="Experience">
<h2 class="header">Experience</h2>
<subsection id="Remunerated">
<h3 class="header">Remunerated</h3>
<div>
<!-- Ocado Technology -->
<p><b>(August 2024 - Present) Data Scientist - <a href="https://www.ocadogroup.com">Ocado
Technology</a></b></p>
<p>Working on the Automated Storage & Retrieval System team using data from robotics sensors.</p>
<!-- BIOStress -->
<p><b>(June 2022 - August 2024) Data Scientist - <a href="https://biostress.com">BIOStress</a></b>
</p>
<p>First full-time data science hire at an early stage startup, responsible for all the models and
data pipeline.</p>
<ul>
<li> Developed a novel stress detection algorithm with better accuracy, robustness, and
generalisability than the state-of-the-art by using careful feature/model selection and
signal pre-processing, leading to a patent submission</li>
<li>Designed scientific trial methodology for a study to be run in collaboration with the
University of Bath to evaluate performance of my model</li>
<li>Deployed model to cloud and working in accordance to ISO 27001 principles</li>
<li>Experience with multimodal data; time-series, free text, and standardised tests</li>
<li>Managed an intern over 3 months, provided guidance and set projects to help develop his data
science skills.</li>
</ul>
<!-- Oxehealth -->
<p><b>(July 2021 - September 2021) Research Intern - <a
href="https://www.oxehealth.com">Oxehealth</a></b></p>
<p>10 week internship at a vision-based medical device company.</p>
<ul>
<li>Gave insight to a new product area for the company (detecting Obstructive Sleep Apnea) by
analysing medical datasets, creating a module to handle polysomnography data, and training
time-series classifier models</li>
<li>Improved algorithm performance by building PySpark tools to audit large amounts of data
leading to a scaleable way of identifying misclassifications</li>
<li>Experience working with sensitive medical data from care homes and NHS mental health trusts.
</li>
</ul>
</div>
</subsection>
<subsection id="Volunteering">
<h3 class="header">Volunteering</h3>
<div>
<p><b>(January 2024 - Present) Statistician - <a href="https://pvrinstitute.org/">Pulmonary Vascular
Research Institute</a></b></p>
<p>
Volunteering in association with the Royal Statistical Society as the primary statistician
analysing data from a large-scale patient survey.
<ul>
<li>Working with academics from the University of Cambridge and third-sector organisations to
publish papers in high-impact journals (manuscripts in progress).</li>
</ul>
</p>
<p><b>(August 2024 - September 2024) AI Safety Engineer - <a
href="https://www.arcadiaimpact.org/">Arcadia
Impact</a>, <a href="https://www.aisi.gov.uk/">AISI</a></b>
</p>
<p>
<ul>
<li>Implementing a multimodal, zero- and multi-shot benchmark (<a
href="https://mathvista.github.io/">MathVista</a>) to AISI's
<a href="https://inspect.ai-safety-institute.org.uk/">Inspect</a> framework and evaluating
the performance against existing and novel models (OpenAI's GPT-4 Turbo, GPT-4o, and GPT-4o
mini) on the benchmark. <a
href="https://github.com/UKGovernmentBEIS/inspect_evals/tree/main/src/inspect_evals/mathvista">Link
to implementation</a> and <a href="https://github.com/UKGovernmentBEIS/inspect_ai/pull/322">PR.</a>
</li>
</ul>
</p>
</div>
</subsection>
</section>
<!-- Education Section -->
<section id="Education">
<h2 class="header">Education</h2>
<div>
<p><b>(2018 - 2022) University of Oxford, Integrated Master's in Mathematics (MMath).</b></p>
<p>Master's thesis on the statistical behaviour of protein folding using data from computational models.
</p>
<p>Areas of focus:</p>
<ul>
<li>Statistics</li>
<li>Machine learning and deep learning</li>
<li>Numerical methods</li>
<li>Computational biology</li>
</ul>
<p><b>(2016 - 2018) King's College London Mathematics School.</b></p>
<p><b>A-Levels:</b> Mathematics (A*), Further Mathematics (A*), Physics (A)</p>
<p><b>AS-Levels:</b> Computer Science - Python (A), Further Additional Mathematics (A)</p>
</div>
</section>
<!-- Expertise Section -->
<section id="Expertise">
<h2 class="header">Expertise</h2>
<div>
<p><b>Python:</b> 5+ years of experience across professional, personal, and academic settings. Use of
standards such as PEP8, type hinting, and environment management. Selected libraries:</p>
<ul>
<li>Data analysis and processing: Pandas, NumPy, SciPy, Polars, PySpark</li>
<li>Machine Learning: Scikit-Learn, TensorFlow, LangChain, PyTorch, Keras</li>
<li>Visualisations: Plotly, Matplotlib</li>
</ul>
<p><b>Cloud:</b> Set up end-to-end data pipelines from scratch and managed a migration from AWS to Azure
</p>
<ul>
<li><b>AWS:</b> Batch, Lambda, S3, EC2, ECR, IAM</li>
<li><b>Azure:</b> Batch, Blob storage, Container Registry, IAM</li>
<li><b>GCP:</b> BigQuery, Looker</li>
</ul>
<p><b>General software engineering:</b> Working on large code bases, unit and end-to-end testing,
version control & CI/CD (git), semantic versioning, Linux (Debian-based/Ubuntu - desktop
& server), and Docker.</p>
<p><b>General data science:</b> SQL, data processing (online, batch, and unified with Apache Beam),
particular aptitude for quantitative analysis of time-series and sensor data, and building robust
and explainable machine learning models.</p>
</div>
</section>
<!-- Publications -->
<section id="Publications">
<h2 class="header">Publications</h2>
<div>
<ul>
<li>J. Newman, <b>S. Munagala</b>, M. Fay, G. Fischer, M. Granato, L. Howard, M. Kurzyna, L.
Macdonald, G. Meszaros, E. Otter, M. Stone, K. Bunclark, M. Toshner, M. Tschida, PVRI IDDI
Patient Engagement & Empowerment Workstream, PH GPS Consortium, J. Pepke-Zaba. 2024.
<i>Pulmonary Hypertension Global Patient Survey: a preliminary overview.</i> [<a
href="resources/ERS 2024 Poster - PH GPS.pdf">Poster</a>]. European
Respiratory Society Congress 2024, 7 September - 11 September. Vienna, Austria.
<ul>
<li>Winner of European Respiratory Society & European Lung Foundation Travel Grant for Best
Abstract in Patient-Centered Research</li>
</ul>
</li>
<li>J. Newman, <b>S. Munagala</b>, M. Granato, M. Kurzyna, L. MacDonald, G. Meszaros, E. Otter, M.
Stone, M. Toshner, M. Tschida, J. Pepke-Zaba. 2024. <i>Pulmonary Hypertension Global Patient
Survey: a preliminary overview.</i> [<a
href="resources/WSPH24 Poster - PH GPS.pdf">Poster</a>]. 7th World
Symposium On Pulmonary Hypertension, 29 June - 1 July. Barcelona, Spain.</li>
<li><b>S. Munagala</b> (inventor), T. Routledge (inventor), BIOStress Lab Ltd. (applicant)
<i>Measurement of Physical Stress Response.</i> [<a
href="https://www.ipo.gov.uk/p-ipsum/Case/ApplicationNumber/GB2402167.7">Pending Patent</a>]
(Patent Application Number GB2402167.7) Patents Journal Number 7037, UK Intellectual Property
Office, Lodged: 16 February 2024.
</li>
</ul>
</div>
</section>
</div>
<br>
</body>
</html>