Jakub Švehla

Data Scientist & ML engineer
Prague, Czech Republic
|

linkedin.com/in/jakubsvehla
|

jakubsvehla.me
|

jakub.svehla@gmail.com
|

+420603831593
Data scientist and ML engineer with 10+ years experience. Built an LLM-based agent system to automate customer support tasks and ML-based apps from a proof of concept to production applications. Skilled in Python, SQL, ML, and software engineering.
J
Š

Work Experience

Freelancing

Senior Data Scientist
|

Jan 2020 - Current
  • Developed an LLM-based agent system to help with customer support tasks, such as ticket tracking, knowledge base extraction, or drafting replies.

BridgeFund

Senior Data Scientist (Contractor)
|

May 2020 - Current
  • Developed a credit risk model and bank transaction categorization from a proof of concept to a production system.
  • Built an LLM-based bank transaction anonymization pipeline.
  • Developed a historized ML feature store on top of Databricks and Apache Spark.
  • Optimized marketing budget allocation with Marketing mix modeling.

FlowCutter

Senior Data Scientist (Contractor)
|

Feb 2021 - Nov 2022
  • Worked on detecting anomalies in network traffic using ML.

Datamole

Data Scientist and Team Lead
|

Apr 2015 - Dec 2019
  • Communicated with clients to identify and understand their problems and help them improve their processes and efficiency with data and AI.
  • Led a data science team and helped hire and supervise new junior data scientists.
  • Worked hands-on on many data science projects, primarily for large international bio- and agro-tech companies.
  • Implemented many machine learning proof of concepts written in Python and Jupyter notebooks, using tools such as pandas, scikit-learn, SQL, Apache Spark, and various non-SQL databases.
  • Deployed ETL and ML pipelines to production in the cloud.

CTU FIT

External Lecturer
|

Sep 2017 - Dec 2019
  • Helped prepare and lectured a graduate course on distributed data mining (both theoretical background and implementation in Apache Spark).
  • Helped prepare and lectured a Python course for beginners (undergraduate).

Wikidi

Data Scientist
|

Jan 2014 - Apr 2014
  • Worked on algorithmic/quantitative trading using machine learning.
  • Applied machine learning algorithms to financial data.

Socialbakers

Data Engineer
|

Apr 2012 - Feb 2014
  • Worked on the backend that collected data from social networks (Facebook, Twitter, Instagram), stored, and analyzed them.
  • Worked with various databases such as MySQL/PostgreSQL, MongoDB, HBase, or Redis.

Education

Faculty of Information Technology CTU in Prague

Data Science, Master's degree
|

Sep 2014 - Jun 2018

The University of Texas at El Paso

Data Science
|

2016 - 2017

Czech Technical University in Prague, Faculty of Information Technology

Bachelor's degree, Computer Science
|

2011 - 2014

Projects

Simple Portfolio

Jan 2020 - Current

Simple Portfolio is a portfolio tracking SaaS with 10,000+ users that I run as a side project.

Scaffold

Scaffold is a full-stack web framework that makes it easy to build apps following the Domain-Driven Design (DDD) approach and layered/onion architecture.

Active semi-supervised clustering algorithms for scikit-learn

An open-source library implementing a wide range of active semi-supervised clustering algorithms for scikit-learn.

Skills

  • Machine learning
  • Data science
  • LLMs
  • Python
  • SQL
  • Apache Spark
  • Data warehousing
  • ELT
  • Docker
  • DDD

Languages

English
Fluent
Czech
Fluent