Project: Realtime AI Financial Risk Application Migration & Development (Telco) | Role: Lead Data Scientist
- Big Data Engineering: Developed ETL pipelines handling massive datasets, event modelling.
- Development of new functionalities (analytics modules, user feedback and model training processes) for internal analytics platform.
- Development of SQL and PL/SQL load runs based on specific requirements and storage of the results in relational SQL databases.
- Anomaly Detection: Built models for time-series anomaly detection and automated root cause analysis, including explainable AI (SHAP)
- Operational Excellence: Created monitoring dashboards to ensure 24/7 system reliability.
Environment: SQL and PL/SQL (Oracle), Apache Spark, Hadoop, Apache Hive, Python and Kubernetes/Docker, Azure, AWS; Keras, scikit-learn/sklearn, MLflow; Jira and Confluence, Grafana, Prometheus, Tableau, Databricks, Unity Catalog.
Project: Speech Recognition Bot (Tech Startup) | Role: Data Scientist
- Implemented automatic speech recognition (Kaldi, Shell) and Named Entity Recognition (NER) for a conversational AI interface.
- Containerized software operationalization.
- Managed stakeholder expectations and defined the technical roadmap.
Project: Address Classification (Logistics) | Role: Assisting Data Scientist
- Implemented ETL pipeline using Kedro + PyTorch
- Inference API operational via FastAPI and Docker
- Data analysis using Matplotlib, Jupyter Notebooks, Python, on Azure
- System stress testing with locust
- Operationalizing LLM