Taruen

Taruen is a custom language technology and data studio based in Istanbul.

About the Founder

Taruen is the independent practice of Ilnar Salimzianov, a computational linguist and MLOps engineer based in Istanbul. Holding an M.Sc. in Computational Linguistics from the University of Stuttgart, Ilnar has architected scalable language technology and ML-ready datasets for organizations ranging from the Mozilla Data Collective to US-based legal tech startups.

Taruen operates on a digital craftsman ethos: building bespoke, robust data infrastructure where no technical problem is solved twice.

Read more about Ilnar's academic and open-source work →

Edge Services

We deploy lightweight, low-latency applications at the network edge via edge.taruen.com.

Open Datasets

We are dedicated to the digital preservation and advancement of regional languages and open data. Our machine-learning-ready datasets are hosted on the Mozilla Data Collective: