Renginiai

Apache Spark as a framework for advanced prediction methods on to BigData

Tema:

Apache Spark as a framework for advanced prediction methods on to BigData (Data science seminar).

Speaker:

Josef A. Habdank,
Lead Data Scientist and Data Platform Architect at Infare Solutions, Denmark
 
Abstract:
 
Developing and deploying a scalable prediction platform is a very challenging task that many big data practitioners are
struggling with. The holy grail of data science/prediction infrastructure is to train the prediction models in real time as
the data is collected and streamed into the data center, and serving the prediction results in an on-demand fashion via a
service.
 
In this talk the speaker will go through a set of online machine learning tools that if used appropriately can be scaled to
work on truly massive datasets with of billions or tens of billions of rows flowing through the system daily.
The talk will cover dimentionality reduction, clustering and prediction using both simple tools such as linear regression
as well as more advanced tools such as Markov Chains.

At Infare Solutions this framework is developed to be used on a massively multivariate time series, collecting a
billion+ new observations daily, as support tool for the airline Revenue Management systems.
 
Renginio data ir laikas:
 
2017 gegužės 4d., 19 val.
 
Renginio vieta:
 
102 a., Naugarduko g. 24, VU MIF