Orateur
Thi Phuong Kieu Nguyen
Summary
The presentation will focus on an introduction of a Big Data project, engineering approach. The four significant steps therein consist of the acquisition, organization, analyses, and restitution. An evolution of the open source software supporting distributed processing of large data sets is playing an essential role in Big Data projects. Then, classical analytic methods of machine learning and data mining are essential to take into account very large data sets in distributed mode. Finally, some use cases of data science projects are introduced.