LMBP-ISM joint workshop

Europe/Paris
Amphi Hennequin (Laboratoire de Mathématiques Blaise Pascal )

Amphi Hennequin

Laboratoire de Mathématiques Blaise Pascal

3, place Vasarely 63178 Aubière cedex
Tomoko Matsui (The Institute of Statistical Mathematics (ISM)), Nourddine Azzaoui (Laboratoire de Mathématique Blaise Pascal - UCA)
Description

Statistical modeling, machine Learning, and artificial intelligence insights  for natural hazard and risk studies

 

The focus of this workshop is on risk management, emphasizing natural disasters such as earthquakes and volcanoes. These challenges are complex, and it is crucial for experts from different fields to collaborate to develop innovative solutions. By combining experts in statistical modeling, machine learning, and artificial intelligence, we aim to develop advanced statistical models expected to identify, measure, and mitigate risks accurately and efficiently. In fact, with the use of machine learning and artificial intelligence technologies, it is now possible to identify hidden patterns and relationships in data that are not discernible through human analysis. As a result, we anticipate the development of better prediction models, enabling the forecasting of potential risks and the implementation of preventive and mitigation measures in advance.

The objectives of this workshop are as follows :

  1. Share knowledge and experience regarding the use of machine learning techniques for risk management in various applications.
  2. Create opportunities for networking and collaboration between young and senior researchers and practitioners in the field of natural disaster risk management from both Japan and France.
  3. Identify innovative solutions and strategies that can more effectively manage risk using statistical modeling, machine learning, and artificial intelligence.
  4. Explore potential opportunities for new international joint research between the Institute of Statistical Mathematics, The Laboratoire de Mathématiques Blaise Pascal, and the International Research Institute of Disaster Science IRIDeS from Tohoku University, while broadening the presence of ROIS both domestically and internationally.
  • Tuesday, 24 October
    • Opening Remarks and Reception Maison de la Vie Etudiante

      Maison de la Vie Etudiante

      Just in front of the Amphi Hennequin

      Opening of the workshop and words from le LMBP director.

      • 1
        Opening by the LMBP director/ Prof. Julien Bichon
        Speaker: Prof. Julien Bichon (Laboratoire de Mathémariques Blaise Pascal (LMBP))
    • Session: Statistical Modeling Amphi Hennequin

      Amphi Hennequin

      Laboratoire de Mathématiques Blaise Pascal

      3, place Vasarely 63178 Aubière cedex
      Convener: Prof. François Septier (Université Bretagne Sud (UBS))
      • 2
        Generalised hyperbolic state space models with application to spatio-temporal heat wave prediction

        As global warming progresses, it is increasingly important to monitor and analyse spatio-temporal patterns of heat waves and other extreme climate-related events that impact urban areas. In this work, we present a novel dynamic spatio-temporal model by combining a state space model (SSM) and a generalised hyperbolic distribution to flexibly describe a spatial-temporal profile of the tail behaviour, skewness and kurtosis of the local urban temperature distribution of the greater Tokyo metropolitan area. Such a model can be used to study local dynamics of temperature effects, specifically those that characterise extreme heat or cold. The focus of the application in this paper will be heat wave events in the greater Tokyo metropolitan area which is known to be prone to some of the most severe heat wave events that have one of the largest population exposures due to high density living in Tokyo city.

        Speaker: François Septier (Université Bretagne Sud (UBS))
    • Coffee Break Maison de la Vie Etudiante (MVE)

      Maison de la Vie Etudiante (MVE)

      Just in front of the Amphi Hennequin
    • Session: Statistical Modeling Amphi Hennequin

      Amphi Hennequin

      Laboratoire de Mathématiques Blaise Pascal

      3, place Vasarely 63178 Aubière cedex
      Convener: Prof. Daisuke Murakami (The Institute of Statistical Mathematics (ISM))
      • 3
        Sub-model aggregation for scalable spatially varying coefficient modeling

        This study aggregates/combines global and local sub-models to build a fast and flexible spatially varying coefficient model. An approach inspired by the generalized product-of-experts method is used to aggregate the sub-models. The aggregated model has the following properties: (i) computationally efficient; (ii) the marginal likelihood is available in closed-form; (iii) each sub-model can be estimated independently to maximize the likelihood. Owing to (ii) and (iii), the proposed model describes complex spatial patterns flexibly and computationally efficiently. The accuracy and computational efficiency of the proposed method are compared with alternatives through Monte Carlo simulation experiments. Then, the method is applied to a regression analysis of residential land prices in Japan.

        Speaker: Prof. Daisuke Murakami (The Institute of Statistical Mathematics (ISM))
    • Session: Statistical Modeling Amphi Hennequin

      Amphi Hennequin

      Laboratoire de Mathématiques Blaise Pascal

      3, place Vasarely 63178 Aubière cedex
      Convener: Prof. Keisuke Yano (The Institute of Statistical Mathematics (ISM))
      • 4
        Minimum information dependence modeling:

        We propose a method to construct a joint statistical model for mixed-domain data to analyze their dependence. The model is characterized by two orthogonal parameters: the dependence parameter and the marginal parameter. To estimate the dependence parameter, a conditional inference together with a sampling procedure is proposed and is shown to provide a consistent estimator. Illustrative examples of data analyses involving penguins and earthquakes are presented.

        Speaker: Prof. Keisuke Yano (The Institute of Statistical Mathematics (ISM))
    • Lunch Break Maison de la Vie Etudiante

      Maison de la Vie Etudiante

      Just in front of the Amphi Hennequin
    • Session : Machine Learning Amphi Hennequin

      Amphi Hennequin

      Laboratoire de Mathématiques Blaise Pascal

      3, place Vasarely 63178 Aubière cedex
      Convener: Prof. Arnaud Guillin (Laboratoire de Mathémariques Blaise Pascal (LMBP))
      • 5
        Limit theorems for neural networks modeling

        TBA

        Speaker: Prof. Arnaud Guillin (Laboratoire de Mathémariques Blaise Pascal (LMBP))
    • Session : Machine Learning Amphi Hennequin

      Amphi Hennequin

      Laboratoire de Mathématiques Blaise Pascal

      3, place Vasarely 63178 Aubière cedex
      Convener: Prof. Véronique Maume-Deschamps (Institut Camille Jordan, Université Lyon 1)
      • 6
        Goal Oriented random forest

        TBA

        Speaker: Prof. Véronique Maume-Deschamps (Institut Camille Jordan, Université Lyon 1)
    • Coffee Break Amphi Hennequin

      Amphi Hennequin

      Laboratoire de Mathématiques Blaise Pascal

      3, place Vasarely 63178 Aubière cedex
    • Session: Insurance and Risk: TBA Amphi Hennequin

      Amphi Hennequin

      Laboratoire de Mathématiques Blaise Pascal

      3, place Vasarely 63178 Aubière cedex
      Convener: Prof. Gareth Peters (University of California Santa Barbara)
      • 7
        Assessing Portfolio Diversification via Two-Sample Graph Kernel Inference.

        In this work we seek to enhance the frameworks practitioners in asset management and wealth management may adopt to assess how different screening rules may influence the diversification benefits of portfolios. The problem arises naturally in the area of Environmental, Social, and Governance (ESG) based investing practices as practitioners need to select subsets of the total available assets using screening rules of ESG ratings and to compare the subsequent risk and return profile of the portfolios created from different selective portfolios. We propose a novel method to compare the diversification relationships of assets in different portfolios based on a machine learning hypothesis testing framework called the kernel two-sample test. The objective of the test is to determine whether two samples come from the same underlying probability distribution. In the case of asset management, the samples are sequences of graph-valued data points that represent a dynamic portfolio obtained by a certain ESG screening rule and certain portfolio optimization criteria such as the global minimum variance or max Sharpe. The fact that the sample data points are graphs means that one needs graph testing frameworks to compare diversification benefits. The problem is natural for kernel two-sample testing as one can use so-called graph kernels to work with samples of graphs. The objective is then to determine if the two dynamic portfolios have the same generating mechanism. A failure to reject the null hypothesis would indicate that ESG screening does not affect diversification while rejection would indicate that ESG screening does have an effect. The article describes the graph kernel two-sample testing framework, further, it provides a brief overview of different graph kernels. We then demonstrate the power of the graph two-sample testing framework under different realistic scenarios. We finally apply the framework to demonstrate the workflow one can use in asset management to test for structural differences in diversification of portfolios under different ESG screening rules.

        Speakers: Ragnar Levi Gudmundarson (Herriot WATT), Gareth W. Peters (University of California Santa Barbara)
    • Dinner Amphi Hennequin

      Amphi Hennequin

      Laboratoire de Mathématiques Blaise Pascal

      3, place Vasarely 63178 Aubière cedex
  • Wednesday, 25 October
    • Session: Earthquake Hazard Amphi Hennequin

      Amphi Hennequin

      Laboratoire de Mathématiques Blaise Pascal

      3, place Vasarely 63178 Aubière cedex
      Convener: Prof. Yo Fukushima (Tohuku University - IRIDeS)
      • 8
        Statistical modeling and machine learning studies on the forecast of earthquakes and fault slip

        Earthquakes refer to sudden and spontaneous rupture, or slip, of a geologic fault. Although the physics of fault slips are not fully understood, the governing equations of fault slips are used along with data assimilation methods to make forecast of future fault slip behavior. On the other hand, by focusing on the location and timing of occurrences, earthquakes have also been regarded and modeled as a point process. In both kinds of studies – that focuses on the fault slip evolution and that focuses on the earthquake occurrence patterns – statistical modeling and machine learning studies are changing the research landscape. Recently, science findings have been increasingly used in society, in particular for providing alert/advisory information related to future earthquake occurrence. It means that methodological developments having a larger predictive ability may change how society prepares for future earthquakes. In the talk, I will review the recent advances of statistical modeling and machine learning studies in earthquake science after explaining the fundamental backgrounds, and subsequently on how the research advances can contribute to society for earthquake disaster mitigation.

        Speaker: Yo Fukushima (Tohuku University - IRIDeS)
    • Session: Earthquake Hazard: TBA Amphi Hennequin

      Amphi Hennequin

      Laboratoire de Mathématiques Blaise Pascal

      3, place Vasarely 63178 Aubière cedex
      Convener: Ryo Nakagawa (Tohuku University - IRIDeS)
      • 9
        Estimation of the fault slip of short-term slow slip events from GNSS data using deep learning

        Since dense geodetic and seismic networks reveal the presence of slow earthquakes and the close relationship between regular and slow earthquakes, many studies have focused on the detection of slow earthquakes and their source characterization. Global Navigation Satellite System (GNSS) continuously monitors ground deformation and is one of the most common tools used to detect slow slip events (SSEs), a kind of slow earthquake. However, GNSS data sometimes needs manual preprocessing due to its low signal-to-noise ratio. Furthermore, automated analysis methods are becoming increasingly important in today’s world of huge data volumes. Deep-learning approaches, especially convolutional neural networks (CNN), have largely contributed automation process to deal with big data. These brand-new technologies have brought significant breakthroughs into many fields including seismology (e.g., Yano et al., 2021) and geodesy (e.g., Rouet-Leduc et al., 2021).
        In this study, we aim to develop a deep-learning method to monitor spatio-temporal evolutions of short-term SSEs based on a dense GNSS network. We theoretically create two types of horizontal deformation data including synthetic noise by assuming 272 subfaults in western Shikoku, southwest Japan; 16 subfaults along the strike multiplied by 17 subfaults along the dip. One is deformations at 113 GNSS stations, and the second is those at 900 virtual stations which are regularly located over the target area. We tailor two supervised-learning Convolutional Neural Network (CNN) models to estimate the slip area and the slip amount of SSE by learning those deformation images as input data. Nakagawa et al. (2021, Fall Meeting in Geodetic Society of Japan) showed that the model trained with GNSS stations estimated SSEs with 91.8% variance reduction (VR) while the other model achieved 98.3% VR. We concluded that this difference in estimation accuracy is contributed to the dissimilarity between input deformation images. Therefore, we newly implement Model-supervised Interpolation (MSI) approach to overcome this problem. MSI successfully reproduces the deformations at 900 virtual stations only from the deformations at GNSS stations with 97.4% VR although nearly half of the target area is located on the offshore region. It shows this deep-learning approach is effective to estimate SSEs from GNSS data in this region.

        Speaker: Ryo Nakagawa (Tohuku University - IRIDeS)
    • Session: Earthquake Hazard: TBA Amphi Hennequin

      Amphi Hennequin

      Laboratoire de Mathématiques Blaise Pascal

      3, place Vasarely 63178 Aubière cedex
      Convener: Yagizalp Okur (Laboratoire de Mathémariques Blaise Pascal (LMBP) and IRIDeS)
      • 10
        Creep characteristics along the central part of the Philippines Fault on Leyte Island from ALOS-2 time-series analysis
        Speaker: Yagizalp Okur (Laboratoire de Mathémariques Blaise Pascal (LMBP) and IRIDeS)
    • Lunch Break Amphi Hennequin

      Amphi Hennequin

      Laboratoire de Mathématiques Blaise Pascal

      3, place Vasarely 63178 Aubière cedex
    • Session: Volcano Hazard Amphi Hennequin

      Amphi Hennequin

      Laboratoire de Mathématiques Blaise Pascal

      3, place Vasarely 63178 Aubière cedex
      Convener: Prof. Olivier Roche (Laboratoire Magma et Volcans (LMV) - UCA)
      • 11
        Statistical analysis of the runout distance of pyroclastic density currents and implications for emplacement mechanisms

        Pyroclastic density currents (PDCs) are hot mixtures of gas and particles generated by volcanic eruptions. They propagate on the ground at high velocity and can travel distances that are commonly of several tens of kilometers. Understanding the factors that control the long runout distance of PDCs is important for hazard assessment. In this context, we collected data on PDCs in more than 200 publications to create a database. Through statistical analysis of the data, we show that the runout distance of PDCs correlates with the mass eruption rate. A model selection procedure further shows that it is possible to determine two well-defined power law relationships, respectively for fully dilute turbulent currents and for currents with a concentrated base. For dilute currents, the runout distance scales with the ratio of the eruption rate over the particles settling velocity to the power 0.5, in agreement with theory. Runout distances of some concentrated currents are longer than 300 km and outside predictions intervals, and in this regard we argue that these extreme travel distances were reached because the currents were confined in large paleovalleys. We conclude that statistical analysis can help to better understand the mechanisms of complex natural phenomena such as pyroclastic density currents.

        Speaker: Olivier Roche (Laboratoire Magma et Volcans (LMV) - UCA)
    • Session: Volcano Hazard Amphi Hennequin

      Amphi Hennequin

      Laboratoire de Mathématiques Blaise Pascal

      3, place Vasarely 63178 Aubière cedex
      Convener: Prof. Nourddine Azzaoui (Laboratoire de Mathématique Blaise Pascal - UCA)
      • 12
        Data-Driven Framework for Uncovering Hidden Control Strategies in Evolutionary Analysis

        Natural risks are characterized primarily by their uncontrollability, or at least, their difficulty in being controlled. In this instance, the goal is to try to understand the mechanism that causes these hazards as well as the variables that affect the processes' evolutionary behavior. In this talk, we provide a data-driven approach for identifying an evolutionary system's hidden control mechanisms. This approach enables deciphering of the hidden processes that leads to these kinds of hazard circumstances. Model predictive control is extended by using new techniques to determine the best control together with the parameters for evolution in general dynamical systems. This is a major divergence from traditional control approaches, which call for an understanding of the system, the capacity to influence its course, and the controller's approach or parameters.

        Speaker: Nourddine Azzaoui (Laboratoire de Mathématique Blaise Pascal - UCA)
    • Coffee Break Amphi Hennequin

      Amphi Hennequin

      Laboratoire de Mathématiques Blaise Pascal

      3, place Vasarely 63178 Aubière cedex
    • Session: Statistics and AI Applications Amphi Hennequin

      Amphi Hennequin

      Laboratoire de Mathématiques Blaise Pascal

      3, place Vasarely 63178 Aubière cedex
      Convener: Prof. Cécile Durot (Université Paris Nanterre)
      • 13
        Unlinked or shuffled monotone regression

        In standard regression models, pairs of covariates and response variables are observed. In the more complex case of shuffled regression (on anonymized data), we only observe a sample of covariates on the one hand, and a sample of responses on the other, but we don't know which response corresponds to each covariate. In the even more complex case where responses and covariates are not necessarily measured on the same individuals, both samples of covariates and responses are still observed, but there is not necessarily a link between them. The data are unlinked. This raises the question of whether the link between the two samples in the shuffled case provides any real information compared with the unlinked case, i.e. whether or not the optimal rates of convergence of estimators are identical in the two models. We provide some answers to this question.

        Speaker: Prof. Cécile Durot (Université Paris Nanterre)
    • Session: Statistics and AI Applications Amphi Hennequin

      Amphi Hennequin

      Laboratoire de Mathématiques Blaise Pascal

      3, place Vasarely 63178 Aubière cedex
      Convener: Prof. Tomoko Matsui (The Institute of Statistical Mathematics (ISM))
      • 14
        COVID-19 Case Prediction Using Emotion Trends via Twitter Emoji Analysis

        The worldwide COVID-19 pandemic, which began in December 2019 and has lasted for almost 3 years now, has undergone many changes and has changed public perceptions and attitudes. Various systems for predicting the progression of the pandemic have been developed to help assess the risk of COVID-19 spreading. In a case study in Japan, we attempt to determine whether the trend of emotions toward COVID-19 expressed on social media, specifically Twitter, can be used to enhance COVID-19 case prediction system performance. We use emoji as a proxy to shallowly capture the trend in emotion expression on Twitter. Two aspects of emoji are studied: the surface trend in emoji usage by using the tweet count and the structural interaction of emoji by using an anomalous score. Our experimental results show that utilizing emoji improved system performance in the majority of evaluations.

        Speaker: Prof. Tomoko Matsui (The Institute of Statistical Mathematics (ISM))
    • Dinner Amphi Hennequin

      Amphi Hennequin

      Laboratoire de Mathématiques Blaise Pascal

      3, place Vasarely 63178 Aubière cedex