M. Tarik Altuncu, PhD — Founding AI Engineer

At a glance

Founding AI Engineer at Amble (Page Nineteen, Ltd.), London since May 2025. A voice-first, personalised language acquisition product backed by South Park Commons.
Previously Senior Data Scientist at Simpplr Inc., San Francisco (March 2022 - April 2024). Built LLM, retrieval, and recommendation systems for a multi-tenant SaaS used by 700K+ people.
Before Simpplr, Data and Analytics Manager and ML Team Lead at TRT (2020 - 2022). Ran a 17-person team across Growth Data Science, Data Engineering, and ML for a public broadcaster's OTT media platform.
Co-Founder at ministo.app (May 2024 - April 2025). Personalised children's storytelling app from the Founders Inc. Winter Cold Start accelerator in San Francisco.
PhD, Imperial College London (2016 - 2021). Graph machine learning applied to NLP. 3 peer-reviewed papers and 4 conference presentations.

Experience

Founding AI Engineer — Amble (Page Nineteen, Ltd.)

May 2025 — Present

London

Joined at a small-team stage in May 2025 and have shipped through several iterations of the platform. The current 6-person engineering team (3 ML/AI engineers, 1 iOS, 1 designer, 1 founder/PM) came together over the second half of 2025.

Personalised immersive language acquisition platform: multi-persona voice agent tutors, AI-generated articles, FSRS-based vocabulary scheduling.

Workstreams I own end-to-end

Multi-provider STT and audio-LLM evaluation harness. 30 model/method combinations across OpenAI, Gemini, Mistral, and ElevenLabs. 1,500 API calls on real production audio. Consensus-based WER with 95% CIs and p95 latencies, plus Pareto-frontier analysis. Found Mistral Voxtral Mini STT outperforms GPT-4o-transcribe on learner speech at lower cost; the harness doubles as a regression test against vendor model updates.
TTS provider selection. ElevenLabs Multilingual v2 for conversational tutor personas; Cartesia Sonic 3 for article read-out, where reliable streaming word-level timestamps in non-English languages outweighed raw quality.
Three-layer voice context injection. Gives the stateless real-time tutor working memory of every prior session. Mem0 with event-anchored timestamps and expiration-aware future facts. Each layer degrades gracefully when an upstream service is unreachable.
Personalised push notifications. Body copy generated per user by Claude Sonnet from the recommendation feed, timezone-aware APNs delivery, and a content-freshness check that drops the push if the user already saw the content in-app. 5.3x lift in notification-driven opens within the first week of full rollout. 7-state lifecycle across 4 services and 2 Redis queues.
On-demand article streaming with word-level audio alignment. Built jointly with another backend engineer. SSE from partial-JSON LLM output; sentence-buffered TTS with shifted timestamps yields a contiguous read-along timeline. Generate-once-per-shared-article semantics via a Redis claim.

Supported alongside the team (primarily owned by the other backend/AI engineers): the Pipecat + LiveKit + Modal voice cascade (sub-second turn latency) and the hybrid recommendation engine with two-layer diversity enforcement that addresses LLM topic convergence (the "coffee problem").

Stack: Python, FastAPI, Pipecat, LiveKit, Modal, PostgreSQL, Redis, Mem0, Latitude, OpenAI Realtime, Claude, ElevenLabs, Cartesia, APNs.

Senior Data Scientist — Simpplr Inc.

Mar 2022 — Apr 2024

San Francisco

Highlights of two years on the AI team:

Retrieval-Augmented Generation system for in-product knowledge search. Among the early enterprise RAG deployments at the time. Milvus and ElasticSearch hybrid retrieval, qualification gate for generated answers, tenant-isolated prompts. Recognised as one of the differentiators in the product's path to the Gartner leadership quadrant.
AI-powered virtual assistant chatbot from POC to production. Presented strategy options directly to the CTO.
Multi-tenant collaborative-filtering recommender on Snowflake, Airflow, MLFlow, Redis, and Kubernetes. Over 100K daily recommendations served.
Fine-tuned Microsoft Phi for personalised content relevance classification at batch-inference scale. 79% drop in false positives.
Customer churn model with LightGBM on Snowflake data. 87% recall at 60-day lead time; ~$5M ARR in potential save.

Assistant Professor — Biruni University, Computer Engineering

2023 — 2024 (currently on unpaid leave)

Designed and taught two new courses for the Computer Engineering programme: an Artificial Intelligence course and a Generative AI course (cmp405genAI/cmp405). Currently on unpaid leave to focus on Amble.

Co-Founder — ministo.app

May 2024 — Apr 2025

Founders Inc. Winter Cold Start accelerator (San Francisco). Designed a multi-modal AI pipeline combining LLMs, image generation, and audio synthesis for a personalised children's storytelling app. Soft-launched in Middle Eastern markets.

Selected past consulting

2024 — 2025

Global Discovery RAG news system (TRT). LangChain and LangGraph pipeline for a global newsroom with editorial guardrailing and an intent-classification evaluation framework.
MediSummarize clinical documentation assistant. Unsloth-based LLM fine-tuning on medical datasets for privacy-preserving clinical-note summarisation. 63% reduction in physician documentation time.

Data and Analytics Manager — TRT

Oct 2021 — Apr 2022

Set the data and analytics direction for a public broadcaster's OTT media organisation, leading a 17-person team across Growth Data Science, Data Engineering, and Machine Learning. The platform reached 5M+ video, 600K+ audio, and 2M+ mobile-game users during this period.

Earlier

ML Team Lead at TRT (2021, 7 engineers across NLP, vision, personalisation; MLOps on DVC, MLFlow, Airflow). Senior Data Scientist at TRT (2020 - 2021, real-time pipeline on 6M+ daily tweets; transformer sentiment for TV drama). Data Scientist at TRT World (2016). Social CRM lead at Adba International (2014 - 2016; Salesforce/Brandwatch/ClaraBridge partnerships for Turkish Airlines and others).

Education

PhD, Applied Mathematics Dissertation: Graph-Based Topic Extraction from Vector Embeddings of Text Documents. Three first-author peer-reviewed papers and conference presentations including NetSci, NetMob, CCS, and KDD.

Imperial College London

2016 — 2021
MSc, Finance

Sabanci University

2014 — 2015
BSc, EEE

Bogazici University

2009 — 2014