Hi ๐, Iโm Stephan!
What do I do? I solve problems using data! I specialize in improving process quality in manufacturing and tackling challenges across various fields, including social media, video games, and e-commerce.
Currently, I work as a data scientist in manufacturing and quality assurance. My role involves:
- Developing business intelligence solutions (e.g., KPI dashboards) for internal and external stakeholders
- Building machine learning applications and optimisation models for monitoring and quality assurance using web-based platforms.
I earned my PhD in Computer Science from the University of Melbourne, where I gained extensive experience in data analysis and data profiling as part of the Melbourne eResearch Group.
Teaching and sharing knowledge is another passion of mine. I teach machine learning and love simplifying complex data through clear, impactful visualizations. Teaching not only helps me share knowledge but also deepens my understanding. As Einstein said: โIf you canโt explain it simply, you donโt understand it well enough.โ
To stay sharp, I regularly explore diverse datasets, read blogs or reddit and refine my expertise. Whether itโs improving processes, solving data challenges, or visualizing insights, I am always excited to tackle the next problem.
- ๐ฅ๏ธ Languages: Proficient in R, Python (including seaborn, shap, matplotlib, and other typical libraries), SQL (MS SQL Server, PostgreSQL, MySQL), and Bash.
- ๐ ๏ธ Version Control: Expertise with GitHub, GitLab, and MS DevOps (pipelines and workflows).
- ๐ Data Visualization: Experienced with Power BI, Qlik Sense, Grafana, and libraries like ggplot2, seaborn, and matplotlib.
- โ๏ธ Cloud Platforms: Skilled in Azure, GCP, and AWS.
- ๐ Data Pipelines: Proficient with Apache Airflow, pipeline automation via MS DevOps and GitHub.
- ๐ APIs: Extensive experience with RESTful APIs (import/export).
- ๐ ๏ธ Big Data: Knowledge of Hadoop and data engineering frameworks.
- ๐ Machine Learning: Comprehensive expertise from regression to advanced techniques:
- Supervised & Unsupervised Learning
- Deep Learning with TensorFlow and PyTorch
- Grid Hyperparameter Tuning, SHAP for interpretability
- Explorative & Predictive Modeling
- ๐ง Large Language Models (LLMs): Hands-on experience with Ollama, HuggingFace Transformers for text generation, text-to-image, image-to-text, and voice-to-text.
- ๐ Time Series Analysis: Extensive experience with forecasting tools like Prophet, change-point detection, and advanced techniques.
- ๐ข Optimization: Expertise in linear programming with lpsolveapi for applications like resource optimization, cost reduction, and sample selection.
- ๐ Predictive Maintenance: Exploring sound and ultrasound analysis (e.g., Frugatto).
- โ
Quality Assurance: Significant experience with:
- Outlier Detection, Completeness Checks
- Trustworthiness Evaluation, Fraud Detection
- Duplicate Identification and mitigation
- ๐ Reporting: Skilled in JAMstack, shiny apps, markdown, and dockerized environments.
- ๐ Model Deployment: Experience with Flask and serving machine learning models in production.
- ๐๏ธ Vision Applications: Expertise with OpenCV for various computer vision tasks.
- ๐ Open Source Contributions: Active on GitHub, TidyTuesday, and StackOverflow.
- ๐ฉโ๐ซ Teaching and Mentorship: Mentored PhD students and contributed to skill development.
- ๐ฌ Passionate about exploratory data mining, data visualization, and gamification.
- ๐ฎ Interested in board games, video games, and scrapping data about them on a lot of platforms.
- ๐ง Enjoys badminton, climbing, and aquaponics.