Data Science

Back to page

Methods of generating values from the data:

Predictive modelling

We have most experience in predictive modelling. We have also carried out our biggest projects so far in this particular field. They involved:

  1. Customer attrition modelling (Churn) - finance industry (data volume (around several millions observations) - ongoing project (since second quarter of 2017)
  2. Default probability for Structural Credit Risk - finance industry (around several millions observations) - this project took place in the first half of 2017
  3. Bookmaker risk analysis (around several millions observations) - second half of 2017

During the projects, we applied different algorithms, such as: random forests, neural networks, logistic regression, Stochastic Gradient Boosting and Support Vector Machines, among others.


We have run the clustering analysis mostly in the retail trade and construction industry (two separate projects at the turn of 2017 and 2018). We've looked for the optimal division on uniform groups of clients, products and vendors.

We usually used K-means and K-medoids algorithms, as well as hierarchical clustering. The analyses have benn carried out on data sets of

up to couple thousand observations.

Asociation analysis

We have carried out three such projects in the retail business.The scope of analysis involved data set up to couple thousand observations. One project took place at the start of 2017, and the other two other at the turn of 2017 and 2018.

Simulation modelling

We made a model that ensured the optimization of processes and costs in face of uncertainty. It was a job for manufacturing company, which had a limited knowledge on future values of the parameters.

In order optimize the operations, it was necessary to run estimations on the basis of multiple simulations. The data set involved several millions observations. This project took place in the second half of 2017.

Text mining

The text mining project involved supplying the data to unfinished sets and correcting incorrect strings on the basis of an available dictionary of possible values.

The project has been carried out in the pharmaceutical industry in the second half of 2016, and the data sets involved up to several dozen thousands observations.

Contact us!
Mobile Reality Sp. z o.o.
Al. Ks. J. Poniatowskiego 1
03-901 Warsaw, Poland
TAX ID: 701-055-92-96
US, Washington
1440 G St. NW
Washington D.C.