Biography

I am a Senior AI Engineer at Balbix. I work on modeling data for cybersecurity, specifically in the field of cyber-risk quantification.

I am a former Master’s student from the Department of Biomedical Engineering at Columbia University and a former engineering student (= Dual BS/MS) in Computer Science and Data Science from Télécom Paris (France’s #1-ranked school in computer science).

I have always been highly interested in mathematics and computer science. I learnt computer languages (Java, C & Python) and have taken numerous courses in machine learning.

During my Master’s at Columbia University, I extended my skills and knowledge in data science to apply them for research in biomedical engineering (genomics, biomedical imagining and neuroscience). I had the opportunity of pursuing two consecutive internships (in a Co-op format) at Balbix (Bay Area) as an AI/ML Intern where I applied those skills for cybersecurity.

My goals? To use my skills in mathematical modeling and machine learning to better understand and address societal problems. I aim to have a positive impact on people’s lives and society.

Feel free to connect with me to chat about science and/or if you are interested in my profile!

Download my résumé (updated on September 20th 2024)

Education
  • Master of Science in the Department of Biomedical Engineering, 2023

    Columbia Univeristy

  • Bachelor of Science & Master of Science in Computer Science, 2023

    Télécom Paris

  • Master of Science in Theoretical Data Science & Computer Science, 2022

    EURECOM | Dual Program with Télécom Paris

  • Preparatory classes, 2020

    Lycée Michelet

Skills

python
Python
java
Java
git
Git
airflow
Airflow
kubernetes
Kubernetes
docker
Docker
aws
AWS
elasticsearch
Elasticsearch
deltalake
Delta Lake
pytorch
PyTorch
sklearn
Scikit-Learn
spark
Spark
duckdb
DuckDB
pandas
Pandas
scipy
Scipy

Experience

 
 
 
 
 
Balbix
Senior AI Engineer
Jan 2024 – Present New York, United States (Remote)
• Led the rolling of an Airflow DAG using Apache Spark and involving Delta Lake and Elasticsearch databases to improve Balbix’s current data pipeline responsible for the product UI’s proper functioning.
• Created a statistical framework for normalizing distributions to provide clients with readable insights of their cyber-risk.
• Optimize Balbix’s cloud infrastructure to reduce cost of Apache Spark tasks by 50%.
• Conducted research and development within the cyber risk quantification team presenting research papers and modeling cybersecurity problems.
• Developed a new feature using an improved Shapley values-based framework to provide clients’ with their next best steps for them to prioritize their IT teams’ work and improve their cyber-risk posture.
 
 
 
 
 
Balbix
AI/ML Co-Op
May 2023 – Dec 2023 San Francisco Bay Area, United States
During the Summer 2023, I perfomed an internship at Balbix within the AI/ML Team to help them improve their modelization of risk’s over a network and develop a new scoring metric for prioritizing vulnerabilities' resolution.

I am continuing my work at Balbix during the Fall 2023 semester as a part-time intern while finishing my studies at Columbia University. I am working on improving Balbix’s CRQ model using new datasets and state-of-the-art modeling techniques following my Summer 2023 work.

Tasks:
• Architecting machine learning models using graph theory and MITRE’s CAPEC dataset to enhance Balbix’s Cyber Risk Quantification (CRQ) platform, which quantifies cyber risk exposure and potential financial impact in business terms ($, €, £, ¥).
• Pioneered a novel cyber risk scoring methodology using Shapley values to help clients intelligently prioritize 1000s of vulnerabilities' mitigations based on potential financial impact. The approach considers each vulnerability’s marginal contribution to overall network risk, providing more nuanced insights than traditional CVSS scores.
• Optimized scoring algorithm runtime by 500%, compared to base implementation, by parallelizing computations in Apache Spark using PandasUDF, enabling hourly updates of risk scoring of millions of devices and vulnerabilities.
 
 
 
 
 
Columbia University in the City of New York
Teaching Assistant
Sep 2022 – Dec 2023 New York, United States
Teaching Assistant for the Department of Mathematics at Columbia University for Calculus II (Fall 2022 & Spring 2023), for Linear Algebra & Probabilities (Summer 2023) and for Calculus I (Fall 2023).

Tasks:
• Evaluated and overlooked assignments, projects and exams performance of over 350 students.
• Enhanced students learning by providing individualized assistance to follow lesson plan.
• Top 9 finalist of the University-wide ‘2023 Presidential Awards for Outstanding Teaching by a Graduate Student Instructor’.
 
 
 
 
 
Electrophysiology, Memory, and Navigation Laboratory
Research Assistant
Sep 2022 – Jan 2023 New York, United States
Research Assistant in Jacobs lab: Electrophysiology, Memory, and Navigation Laboratory (https://jacobslab.bme.columbia.edu/)

Worked on a research project to develop classification machine learning models (using Linear Classifier, SVM, and MLP) to predict visual stimulus using electrocorticography neuronal signals with statistical significance.
 
 
 
 
 
Telecom Etude
Machine Learning Consultant
Jun 2022 – Jul 2022 Palaiseau, France
• Developed for a startup machine learning models using Linear Regression, SVM, or XGBoost to predict time-series of daily attendance levels over a week of restaurants in Paris.
• Automated the collection and the pre-processing of open data from 6 different sources (weather, road traffic, events, etc.) depending on the location of the restaurants.
• Released a statistical method using Meta Prophet framework to predict attendance levels with 95% confidence intervals.
• Compiled a report to explain the functioning of my code and the impact of different parameters.
 
 
 
 
 
Telecom Business & Finance
Secretary-General
Jul 2021 – Jul 2022 Palaiseau, France
• Organized conferences with prestigious alumni (Paul-François Fournier, Michel Combes, Fred Potter, Jean Schmitt).
• Expanded corporate relations (partnerships with PwC, Strategy& and AlumnEye).
• Handled the communication of the association (Facebook, Instagram, LinkedIn and the website).
• Managed all the association’s administrative procedures and archives.
• Realized a conference on personal finance to introduce this subject to Télécom Paris' engineering students and explain them why they should invest as soon as possible, where they could start investing in and how they could develop their portfolio in the future
• Participated, as Secretary-General, in the definition of the association’s strategy to make it grow.
 
 
 
 
 
Dream Team Des Etudiants
Private teacher
Mar 2022 – Jun 2022 Alpes-Maritimes, France
• Tutored 4 students from 6th grade to 12th grade in mathematics, physics, French, and chemistry for preparation to French examinations.
• Helped students with post-baccalaureate education counseling.
 
 
 
 
 
Forum Télécom Paris
Secretary-General
Oct 2020 – Jan 2022 Palaiseau, France
Managed a team of 40 people to organize Télécom Paris' 2021 career fair
Results : more than 500 students, 78 companies, €200k turnover, company satisfaction: 4.1/5 and student satisfaction: 4.4/5.
Handled:
• corporate relations (prospecting, customer service, etc.);
• logistics (choice of furniture, organization before the event, etc.);
• communication (administrator of Facebook & Instagram pages);
• the website (development of a digital business card solution via a QR Code system, CV library, interactive map, etc.);
• the organization of the association’s events (association campaigns, student parties, distribution of €55k for student projects, etc.).
As an elected member of the association’s board to represent both students' and school’s interests, I was always in discussion with the administration and corporate relations of Télécom Paris. I was also responsible for major decisions concerning the association.
 
 
 
 
 
Sopra Steria
Software Engineer Intern
Jul 2021 – Jul 2021 Paris, France
Joined a team at Sopra Banking Software as part of the Undergraduate Program set up by Sopra Steria.

Tasks:
• Automated their internal documentation website (Python & GitLab) for referencing solutions deployed for their customers;
• Automated verification of bank security certificates from providers (Python & AWS Lambda) to send the necessary information to the Support teams.

I also learned more about the group by discovering all the verticals and subsidiaries of Sopra Steria. I was able to spend a full day with the HR manager and member of the Comex of Sopra Steria Group: Jean-Charles Tarlier.

Studies

Master of Science in Biomedial Engineering
I am pursuing my academic curriculum within the Department of Biomedical Engineering at Columbia University.
I am taking courses in which I am able to apply my Computer Science and Data Science skills for Biomedical Engineering research (genomics, biomedical imagining and neuroscience).

Coursework:
DROM 9120 (PhD) Dynamic Programming and Optimal Control,
APMAE 4990 Mathematics of Data Science,
• BMENE 4460 Deep Learning in Biomedical Imaging,
• BMENE 4480 Statistical Machine Learning for Genomics,
• BMENE 4110 Biostatistics for Engineers,
• ECBME 4060 Introduction and Data Science for Genomics,
• BMEBW 4020 Computational Neuroscience,
• EEBME 6091 Topics in Computational Neuroscience,
• BMENE 6003 Computational Modeling of Physiological Systems
Master of Science in Theoretical Data Science & Computer Science | Dual Program with Télécom Paris
EURECOM is a prestigious Research Center of Telecom Paris in digital science with recognized academic teams (TUM, PoliTo, IMT, Aalto, etc.) and industrial partners (BMW, Orange, Norton, etc.).

Coursework (given in English):
• MALIS | Machine Learning and Intelligent System,
• DL | Deep Learning,
• MALCOM | Machine Learning for Communication Systems,
• AML | Algorithmic Machine Learning,
• ASI | Advanced Statistical Inference,
• Optim | Optimization Theory with Applications,
• Clouds | Distributed Systems and Cloud Computing,
• DBSys | Database Management System Implementation,
• QUANTIS | Quantum Information Science,
• WebInt | Interaction Design and Development of Modern Web Applications,
• ManagIntro & Business Simulation | MBA classes of introduction to management and business,
• ProjMan | Project management,
• TeamLead | Personal Development and Team Leadership,
• General introduction to law: contracts, setting up a business,
• AwaRe | Awareness-raising to research.
Bachelor of Science & Master of Science in Computer Science
A highly selective French Engineering School, Télécom Paris is considered to be the leading French school in Computer Science. 150 students admitted for over 15 000 applicants.

The Telecom Paris Dual Bachelor of Science and Master of Science program prepares engineers in the fields of Computer Science, Data Science, Digital Economics and Telecommunications.

Coursework:
• Computer Science: Java Programming, Information Theory, Formal Languages, Operating Systems, C Programming, Networks,
• Applied and Advanced Mathematics: Probability & Statistics, Linear Algebra, Analysis, Signal Processing and Graph Theory,
• Physics: Optics and Photonics, Antenna and Propagation, Micro and Nano-Physics,
• Electronics: Acquisition Systems, Processors Theory,
• Economics & Humanities: Introduction to Economics, Management Science, Ethics, Geopolitics, Political Science, Writing an essay on respect and freedom of speech.
Preparatory classes • Intensive courses in Mathematics, Physics, Chemistry and Engineering Sciences
Two-year undergraduate to get prepared to national competitive examinations for admission to the French “Grandes Ecoles”.
I chose a specialisation in Physics & Chemistry in January 2019 and entered in a high-level class (called star classes) for the scholastic year 2019/2020.
Global outline:
• 2100h of intensive classes, tutorials & labs (600h of mathematics, 600h of physics, 360h of chemistry, 160h of philosophy, 150h of computer science & 120h of English);
• 60 4h-written-examinations (one to two per week);
• 2 1h-oral-examinations and 2 assignments (in mathematics and physics) each week.

Projects

*
Machine Learning Prediction of CITE-seq Protein Expression from scRNA-seq Data
Our project, based on a Kaggle competition, was to work on the prediction of cell surface protein expression (CITE-seq) from single-cell RNA expression data (scRNA-seq).

Contact