key: cord-0167467-86xdi3i8 authors: Singh, Manmeet; Kumar, Bipin; Chattopadhyay, Rajib; Amarjyothi, K; Sutar, Anup K; Roy, Sukanta; Rao, Suryachandra A; Nanjundiah, Ravi S. title: Machine learning for Earth System Science (ESS): A survey, status and future directions for South Asia date: 2021-12-24 journal: nan DOI: nan sha: b4fb922a3e2c2a2638823e774c362efa9e2d9f37 doc_id: 167467 cord_uid: 86xdi3i8 This survey focuses on the current problems in Earth systems science where machine learning algorithms can be applied. It provides an overview of previous work, ongoing work at the Ministry of Earth Sciences, Gov. of India, and future applications of ML algorithms to some significant earth science problems. We provide a comparison of previous work with this survey, a mind map of multidimensional areas related to machine learning and a Gartner's hype cycle for machine learning in Earth system science (ESS). We mainly focus on the critical components in Earth Sciences, including atmospheric, Ocean, Seismology, and biosphere, and cover AI/ML applications to statistical downscaling and forecasting problems. . The recent increase in computational power has given rise to application to novel techniques. In the last few decades in conjunction with increasing computational power we notice significant improvement in forecasts at various scales using numerical techniques. The advent of satellites, modern instruments and advanced global/regional modelling capabilities has helped in amassing large amounts of data surpassing petabytes per day. Hence the need of the hour is to exploit this data innovatively. These datasets have been collected using sensors that monitor magnitudes of states, fluxes, and more intensive or time/space-integrated variables. The Earth system data exemplify all four of the "Four V's of Big Data" concerning volume, velocity, variety, and veracity. Looking at the big picture shows that our capacity to gather and store data vastly outpaces our ability to access it, much alone comprehend it meaningfully. The power to make accurate predictions has not risen in parallel with data abundance. We will need to undertake two significant endeavors to maximize the wealth of Earth system data growth and diversity, which are (1) identifying and utilizing data insights, and (2) developing predictive models that can discover previously unknown laws of nature while still honoring our evolving understanding of the laws of nature. Advances in computing capacity and enhanced data availability provide exceptional new prospects. For example, machine learning and artificial intelligence technologies are now accessible, but they require additional development and adaptation to geoscientific study. In both spatial and temporal domains, new methods present new opportunities, new problems, and ethical demands for contemporary study fields in ESS 1 . Machine learning algorithms have grown with data availability, successfully applying to many geoscientific processing schemes, including the atmosphere, the land surface, and the ocean. Land cover and cloud classifications have been precursors to the GIS field since the resurgence of neural networks, thanks to the availability of very high-resolution satellite data. The majority of machine learning research in methodology (for example, kernel techniques or random forests) has since been applied to geoscience and remote sensing issues. That is frequently the case with new data appropriate for the specific approaches. In other words, machine learning has emerged as a versatile method for geoscientific data categorization. For example, change and anomaly detection issues may now be tackled using these techniques. In addition, deep learning has been employed in geoscience in the past several years to exploit better spatial and temporal patterns in the data, aspects that standard machine learning often struggles. Machine learning finds applications throughout all of ESS, and more algorithms are being incorporated into operational systems. New patterns are being discovered, in addition to the knowledge used to assess the many Earth system models. Machine learning aims to uncover the transformation functions which map to the fields of high interest such as precipitation, temperature, and others. The developments in the physical sciences associated with simple statistical methodologies have left a large gray area in uncovering the relationships leading to complex, non-linear variables. There is a need to dedicate resources to using advanced machine learning based tools to decipher the links to physical fields which are still out of our reach and improve their predictability. Developments in deep learning, deep reinforcement learning, transformers, non-linear science and recent advances in interpretable machine learning are the areas that can help to solve crucial research problems in ESS. Recognizing this need to effectively utilize this large data effectively, the Ministry of Earth Sciences recently setup a virtual centre for Artificial Intelligence and Machine Learning devoted to Earth Sciences and anchored at the Indian Institute of Meteorology, Pune. Previous surveys on the use of machine learning in ESS are summarized in Table 1 . These reviews have mostly focused on the broad applications of machine learning in Earth science problems. Rolnick et al. 2019 2 focus is the most elaborated review yet on the topic, but they focused in general on the solutions to tackle the problems associated with climate change using machine learning. Others focused more on hydrology or remote sensing problems, with Reichstein et al. 2019 3 being the nearest survey to the one we have worked upon in this paper. • The previous surveys have only addressed the problems within ESS in general. There is a need for a review paper focusing on the studies and issues addressing the South Asian region. For example, the Indian monsoon is one of the most complex climate phenomena whose mystery is yet not fully solved. It requires special focus and attention to address the challenges in accurately predicting the various spatiotemporal scales of the monsoon. • The studies summarized in Table 1 have not considered the latest state-of-the-art algorithms such as the attention-based Transformers and the Generative Adversarial Networks. The advancements brought by these models in the computer vision and natural language processing community make them excellent candidates to be explored in the domain of ESS. • This review outlines all the previous review papers on the subject, delineates the tools required, the material required by anyone to gain hands-on experience in machine learning and can be used to further the applications of machine learning in ESS. This section discusses the algorithms, data, problems, tools, educational material, feature engineering, and the emerging areas related to machine learning in Earth sciences. These have been summarized in the mind map, depicted in Figure 1 , taking the case of weather and climate sciences as an example. Various algorithms which have shown remarkable performance in computer vision, natural language processing, and reinforcement learning etc can be directly applied to the problems of ESS. For example, the super-resolution methodology (SRCNN, DeepSD) developed by Dong et al. (2015) 8 to enhance the resolution of image datasets has been used to downscale the precipitation datasets from coarser resolution to a high resolution 9, 10 . Seasonable forecast of various aspects of monsoon has been studied using single and stacked encoder-based techniques 11, 12 . Prediction of solar irradiance using CNN with added attention has been used 12 . Recent advances in the computer vision community show that algorithms such as SRGAN, LapSRN, FSRGAN, and UNET outperform the standard SRCNN. Long short-term memory (LSTM) networks, sequence to sequence networks, and the recent attention-based Transformer models have improved the accuracies in natural language processing. Some of these algorithms have also been used or can be applied to the time series forecasting problems in ESS. For example a survey on these applications can be found in Lim and co-authors 13 . Weather and climate data are so massive that it is not even encountered by the community working on big data. The spatio-temporal nature of the datasets, i.e., three-dimensional fields at each temporal dimension, makes it a complex problem to solve. The patterns in this four-dimensional data cannot be deciphered manually, and machine learning offers the perfect opportunity. Models that have shown good performance on video datasets such as ConvLSTM can be used to build large-scale deep learning-based systems which can predict the information in high spatial and temporal resolutions 14, 15 . Sequence-to-sequence and LSTM has been used to predict and forecast active-break cycles of Indian monsoon 16 . Before starting any analysis, traditional algorithms such as random forests, support vector machines and multivariate linear regression should be the first goto methods. EnhanceNeT and PSPNet are algorithms which can be used for classification of objects in images and spatially locating them. They have shown excellent results in computer vision applications. They can be used for problems such as the identification of floods from satellite imagery. The understanding of ESS datasets is most important while developing machine learning models. These datasets primarily come in three classes, viz the observational data, reanalysis product combination of model data and observation created in a consistent manner) , dynamical model simulated outputs (such as climate change data from models) . For the South Asian domain, longperiod ground-based observations from the India Meteorological Department (IMD) are available. These datasets can now be obtained from https://dsp.imdpune.gov.in. Satellite-based products are available from the Tropical Rainfall Measuring Mission (TRMM), Landsat, Sentinel and MODIS. Reanalysis products are gridded products and are very useful for the fields which are/cannot be measure directly by instruments. They offer insights into the information which is nearest to reality. Various reanalysis products are available for the South Asian region, such as MERRA-2 reanalysis and (vi) JRA55 reanalysis. As regards model products are considered, various model products such as TIGGE, CMIP5/CMIP6, and Seasonal to sub-seasonal (S2S) hindcasts are available. The model outputs are based on the integrations of partial differential equations based on dynamical systems. Machine learning offers an innovative methodology to improve these dynamical model estimates by combining them with observed or reanalysis products. The archive of seismic waveform data, Global Positioning System (GPS) data, oceanographic and other geoscience datasets in the country are increasing exponentially every year, calling for fast and efficient processing and dissemination of information to the public service systems. South Asia is home to more than two billion inhabitants and is heavily dependent on the natural climate variability for livelihood. For example, the Indian monsoon feeds agricultural lands over the region, thus directly impacting the region's economic well-being. Monsoon is a complex multiscale problem and is nonlinear and hence linear methods cannot unravel the real processes especially the feedback processes leading to its variability. Forecasts at various temporal scales such as short to medium range (1-10 days), extended range (2-3 weeks), seasonal scale (for coming season) and climate scale (100s year) are important for planning hydrological resources over the region. It has been known that the crop yields are dependent on the meteorological variables; machine learning can be used to accurately forecast the spatial crop yield a season in advance and economically benefit the society. The demographics in the South Asian region have considerably changed in the past decades and many people now live in the cities. Moreover, the population density in these areas is very high. Hence, locally accurate urban forecasts are need of the hour. These locations are also sources of chemical species which are harmful for the environment and all living beings. Hence air pollution prediction is an important problem. Identifying localities with higher air pollution is also essential for city planning, for example, deciding the number of electric buses by the city authorities. Machine learning-based algorithms can be used to improve the cyclone forecasts of dynamical models. Extreme weather events such as heat waves and cloud bursts are causing havoc in recent times, as it is challenging to predict them accurately. In seismology, AI/ML techniques are being tried out for earthquake detection, phase-picking (measurement of arrival times of distinct seismic phases), event classification, earthquake early warning, ground motion prediction, tomography and earthquake geodesy. The availability of open-source software packages has provided a bridge to the domain experts to avoid reinventing the wheel while applying machine learning to their problems. Python is the most popular language for machine learning, and various libraries such as TensorFlow, PyTorch, Theano, mxnet, OpenCV, Keras, and others are available freely. Visualization software such as TensorBoard and Tableau assist in communicating the results from machine learning models. In addition to the software requirements, deep learning needs Graphical Processing Units (GPUs) to perform the tensor computations in neural networks. Tensor Processing Units (TPUs) are a step ahead of GPUs wherein the neural network is encoded on the chip to perform fast calculations. The TPUs are only available over the cloud, and every individual can't buy a personal GPU for deep learning. Hence, free and paid cloud computing services, such as Amazon Web Services (AWS), Microsoft Azure, Google Cloud Platform (GCP), Paperspace, Digital Ocean, Google Earth Engine provide an option to build machines over the cloud to perform deep learning and conduct analysis in climate science 17 . A step further, the concept of Jupyter notebooks as a service has become popular and there are several frees as well as paid vendors providing notebook as a service. Notable amongst them include the free services provided by Kaggle, Google Colab and others. More cloud vendors can be found at https://github.com/binga/cloud-gpus, https://github.com/zszazi/Deep-learning-in-cloud, https://github.com/discdiver/deep-learning-cloud-providers/blob/master/list.md and others actively maintained. "Docker containers" have also become an important part of the ecosystem helping deploy end-to-end packages for deep learning. A key component in the machine learning cycle is the educational resources to build knowledge and apply it to Earth and climate science. The avenues to learn data science and hence use machine learning for Earth science applications are the Coursera specializations, courses, professional certificates, Udacity nanodegrees, Udemy courses and other free and paid material available as MOOCs. The Development of Skilled Manpower in Earth System Sciences (DESK) at the Ministry of Earth Sciences (MoES), Government of India regularly holds training programs to train young researchers on machine learning applications in in Earth sciences. One such training workshop was conducted in 2021 and the details can be found at https://www.youtube.com/watch?v=2cQmhDgFinQ&list=PLgQCKqNw6z_CMKiKoWMAukvo 8vyEe1KLb After the weather/hydrological forecasts have been generated, they have to be used to take decisions for the benefit of society. Deep reinforcement learning is an excellent methodology for this purpose. State of the art algorithms such as deep q networks, vanilla policy gradient, trust region policy optimization, proximal policy optimization, deep deterministic policy gradient (DDPG), soft actor critic, twin delayed DDPG and others can be used to train agents which can guide in decision making. The most crucial aspect of deep reinforcement learning is the design of the environment, action(s), and reward(s). The decision-making can be used for disaster preparedness/mitigation, hydrological planning, and other associated tasks. Feature engineering is the generation of meaningful predictors or parameters to improve the performance of a machine learning model. It is performed after cleaning the data and preparing it in a format that can train statistical models. It has been noted that removing redundant variables improves the performance of machine learning systems. Various methods can be used to find the most valuable predictors some of them are: Principal Component Analysis (PCA), Empirical Orthogonal Functions (EOF), and Independent Component Analysis (ICA). Binning, counts, transforms or filtering can be used to extract the predictive signal from data to improve the models. Unsupervised learning techniques such as autoencoder can also assists in finding valuable predictors from raw datasets. The deep learning-based models are, however, coded for imagebased input datasets. To overcome this limitation, strategies such as transforming the spherical global data to a cubed sphere or tangent planes mapping can effectively reduce spherical distortions in the data. While the previous decade has seen the hype of deep learning overshadow other machine learning methodologies, there are numerous emerging and innovative machine learning methods which can be used for ESS. Graph machine learning is training neural networks on graphs and is becoming increasingly popular. Complex networks and recurrence plots come in the category of nonlinear methodologies and come in handy for specific applications. One major concern while using machine learning for the physical sciences is that these models are known as black box models. Interpretable machine learning aims to address this concern and analysis of deep learning model weights reveal the patterns learned. Ongoing active research is happening in this area, and it is crucial for the increasing acceptability of deep learning models at the production scale in ESS. The emerging fields of augmented reality, virtual reality, improved remote sensing measurements, crowd-sourcing and drone technology offer excellent potential to advance the observation data collection and improve machine learning models. The A/ML algorithms have vast applications in the Earth Science problems. Figure 2 depicts few applications in the areas, including atmosphere/biosphere, seismology, and ocean. The precipitation forecasting can include data from short-range, medium-range and extended-range forecasting. Downscaling of data is necessary to get a local projection of the information. The present-day models and observations generated from weather stations (or other instruments) are available at coarser resolution and are irregularly spaced, which may often provide misrepresentation (or absence) of precipitation, temperature, or other variables at local levels. Downscaling of Indian Summer Monsoon (ISM) rainfall is a difficult task as it involves a multi-scale spatiotemporal dynamical process with significant variance 18 . Further, regional variations of ISM rainfall are often quite large varying from a few millimeters to thousands of millimeters within a few hundred kilometers. The ISM rainfall can be classified into different coherently fluctuating zones, which may be linked to complex multi-scale processes [19] [20] [21] . The statistical downscaling is a low-cost method to obtain information at the local scale and provide it to stockholders. Artificial Intelligence (AI) and Machine Learning (ML) techniques are being used for statistical downscaling 8, 22 . Recently, development in the Single Image Super-Resolution (SR) using Deep Learning proved to be one of the best methods used for this purpose [8] [9] [10] . Another method that showed promising results in the statistical downscaling is the ConvLSTM method documented by Harilal and coauthors 23 . The growing volume of seismological and other geoscience-related datasets acquired from surface and borehole studies, require efficient techniques for analysis and trend recognition to extract valuable signals. AI/ML tools have been applied in different fields in seismology, from event identification to earthquake prediction, with varying degrees of success [24] [25] [26] [27] [28] [29] The case studies also bring out the need for further research and development to refine the existing techniques and develop new tools that could be utilised in the processing and analyses of large datasets and identification of different geophysical signals. The use of AI/ML techniques in geoscience / seismological could be employed gainfully to analyse other seismological datasets that are routinely acquired by MoES and its affiliated institutions. Identifying seismic phases, accurately, is one of the primary requirements for seismological data analysis towards the determination of earthquake source parameters. ML is planned to be used in identifying different seismic phases in the data. In many earthquake detection algorithms, STA/LTA criteria are being used to detect possible arrival times of P and S waves 30 . Later, matched filtering or template matching technique was used for event detection. In this method, waveforms of known events are used as templates to scan through continuous waveforms to detect new events 31 . Recently, ML is utilized to improve earthquake detection and phase picking capabilities 25, 32 . Fingerprinting and Similarity Thresholding (FAST) is the latest algorithm using ML techniques to identify earthquakes without prior knowledge of seismicity. Being computationally more efficient than template matching FAST would facilitate automated processing of large voluminous datasets. Similarly, the generalized phase detection (GPD) algorithm searches for near identical waveforms from millions of seismograms, which is used to classify windowed data as P, S or noise. GPD can be applied to datasets that were not just encompassed by training sets but can also be applied to difficult cases such as clipped seismograms. Kong et al., (2018) 33 used Neural Networks (NN) for training to pick P-wave onset and to detect P-wave polarity. ML techniques have important applications in the detection of small magnitude local earthquakes in areas which are characterized by sparsity of receivers. ML/AI algorithms may play an important role in the identification of events and in locating earthquakes with recordings of the events at fewer stations 33 . Other applications in earth sciences such as hydrology, AI/ML can be used for estimating and predicting stream flow in ungauged basins [35] [36] [37] . Currently, the global highest resolution ensemble predication system at ~12.5 km horizontal resolution (with 21 members) is being used for providing 10 days probabilistic forecast based on the Global Ensemble Forecast System (GEFS@T1534) India Meteorological Department. The high resolution GEFS has been implemented by IITM for operational application since June 2018. While the deterministic GFS model 38 at 12.5 km provides a better skill up to ~5 days compared to the earlier coarser resolution (~25 km resolution GFST574) 39 , the ensemble prediction system has shown much better skill than the control member (the deterministic GFS model) particularly for predicting extreme rainfall events [40] [41] . The model forecast inaccuracies mainly arise from initial conditions and improper physical parameterizations. The uncertainties of initial conditions are largely resolved by the perturbed initial conditions in the ensemble prediction system, however, the uncertainty arising from deterministic closures of the physical parameterization still adds much errors due to unrealistic constraints, namely the quasi-equilibrium 42 . Under the AI/ML/Dl paradigm, the use of sub-grid scale tendencies generated by the cloud resolving models within each climate model grid as the input of a deep learning model for training to target the heat and moisture trained tendencies hold promise in improving the model fidelity [43] [44] [45] . AI/ML methods recently find applications in the climate forecast models. Two basic applications show promise for near future applications. The first one is the bias correction and improvement of numerical model forecast. The other one is the methods attempting the sub seasonal low frequency forecast. The bias correction and model post processing applications are of much use to the stakeholders using climate forecast. The climate forecasts from the dynamical models show a lot of bias when forecast is considered over scales lower than the balanced flow, mainly arising due to unknown physics or unresolved dynamics. In situations where enough observation are available over a location, some of the systematic errors arising due to unresolved scale dynamics or physics can be corrected 46 . Subseasonal forecasting using machine learning methods are recently under active research 12,47-49 Seasonal forecasting is one of the tough problems in forecasting. As pointed out by Lorenz (1963) 50 the weather forecasts are strongly dependent on initial conditions (today's weather determines tomorrow's weather) and in contrast climate projections/decadal predictions (an average of weather for a few decades) do not suffer from the initial conditions, however, depends on boundary conditions. When we try to make seasonal forecasts, the distinction is rather blurred and the seasonal forecasts still depend on intial conditions 51 . Chattopadhyay et al (2016) 51 have shown that models hindcasts initialized with February initial conditions exhibit better prediction skill for Indian Summer Monsoon rainfall (ISMR). Further complexities such as resolving ocean processes also become important at seasonal scale. Hence, extracting predictive information (which changes from event to event) across both space and time scales is very important to make significant progress in seasonal forecasts 52 . Therefore, use of AI/ML methods for making improved seasonal forecasts is imperative and research community started using these methods widely in seasonal forecasts [53] [54] [55] and some researchers even believe that AI/ML methods can outperform conventional prediction systems [54] [55] . One of the long-standing seasonal prediction problems is prediction of Indian summer monsoon rainfall. H.F. Blandford started seasonal forecasting of Indian summer monsoon using empirical methods in 1886. Since then, numerous attempts were made to predict seasonal mean monsoon over India using both empirical models and dynamical models (Atmosphere and coupled oceanatmosphere models, see review article of Rao et al., (2019) 39 for more details). Empirical models showed very high skills (>0.9) during development stages and during actual operational phase they showed weak skills (<0.5). On the other hand, dynamical models showed moderate skill during hindcast period as well as during operational forecast 39 . The major reason for empirical models' failure to provide high skills during operational phase is that the relationship between predictor and predictands undergo secular changes from the time the model was developed to the phase when it is made operational. To avoid such a situation AI/ML models can be used efficiently to identify new predictors 53 . Using autoencoders Saha et al (2021) 53 have developed an AI/ML model to predict IMSR with two months lead time and absolute mean error less than 3%. On the other hand, the dynamical models exhibit systematic biases in precipitation (See Rao et al., 2019 39 ) and basically arise due to parametrization schemes used in these models and therefore underestimate extremes. To avoid such systematic problems AI/ML models will become handy. Dynamical models work on the principle of solving partial differential equations over the area of interest with the necessary initial and boundary conditions. They consist of various components such as atmosphere, ocean, land surface and others. The correct representation of physical processes in the numerical models is highly essential for accurate simulations of the coupled climate system. For example, various researchers have tried to understand the relationship of global and regional teleconnections such as ENSO 56 65 discuss the role of machine learning in estimating the cloud microphysics. The uncertainties in the simulation of Indian monsoon arise from the missing or erroneous physics in the dynamical systems. Machine learning to improve the physical processes can lead to cascading returns by improving the hydrological outputs from the numerical weather prediction models [66] [67] [68] [69] [70] . The need for a high-resolution early warning system with reliable nowcasts in the regions of steep topography and urban areas during severe weather is highly essential. Traditionally, Nowcasting is served by carrying out extrapolation, probabilistic Nowcasting 71 , semi-Lagrangian advection scheme 72 Satellite remote sensing and NWP groups are ripe for rapid advancement in their application of machine learning. The NWP relies heavily on the integration of fields generated by satellites and other remote sensing devices. Gaps, both spatially and temporally, are a common occurrence in such data. The existence of gaps, both spatially and temporally, is a typical issue in such observations. The time series of satellite ocean fields are constructed using an ensemble of NNs with varying weights 76 , and a deep learning method to reconstruct the optical images 77 . When modeling, deploying systems, and even issuing warnings, the ML method can give a post-forecast correction to account for uncertainties after learning from all previous failures 78 . In this study, a review of the applications of machine learning in ESS has been made. The future directions focused on solutions for the South Asian region have been summarized as a Gartner's curve in Figure 3 . Hard AI problems such as that of earthquake prediction and climate scale predictions. These problems require long lead times of several years to centuries and will take more than a decade of development to be fully solved by machine learning and allied techniques. Such a long development time is expected because of the sparsity of data, for example, over the Himalayan region, under the solid Earth for earthquake prediction. The large uncertainties in dynamical models to project end of century estimates of the climate are also expected to be resolved after extensive research and development. Recent developments in machine learning, particularly in deep learning are expected to lead to transformative improvements in the short to extended range forecast, smart transportation, precision agriculture, policymaking, wind and energy forecasts during this decade. These advancements would be driven by the critical nature of these problems and the availability of high spatiotemporal drones, ground-based observations and satellite datasets. We have discussed various AI/ML techniques used and the ones that have high potential for improving the state-of-the-art in the ESS. An exhaustive literature survey on the applications over South Asian domain, a mind map incorporating all the essential components of data science applications in ESS, and a Gartner's curve for future directions are the main contributions of this study. It can be used as a starting point to understand the existing research problems, applicable algorithms, educational resources required, hardware/software needs and other important aspects essential to work on the applications. As an end goal, this work aims to further the ESS over South Asia using machine learning applications. Opportunities and challenges for machine learning in weather and climate modelling: hard, medium and soft AI Tackling Climate Change with Machine Learning. arXiv.org Deep learning and process understanding for data-driven Earth system science A Transdisciplinary Review of Deep Learning Research and Its Relevance for Water Resources Scientists A comprehensive review of deep learning applications in hydrology and water resources A Comprehensive Survey of Deep Learning in Remote Sensing: Theories, Tools and Challenges for the Community Image Super-Resolution Using Deep Convolutional Networks Deep learning-based downscaling of summer monsoon rainfall data over Indian region Generating High Resolution Climate Change Projections through Single Image Super-Resolution. arXiv.org Autoencoder-based identification of predictors of Indian monsoon Prediction of the ENSO and EQUINOO indices during June-September using a deep learning method Time-series forecasting with deep learning: a survey Deep Learning Based Forecasting of Indian Summer Monsoon Rainfall Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting Deep Learning Based LSTM and SeqToSeq Models to Detect Monsoon Spells of India Quantifying COVID-19 enforced global changes in atmospheric pollutants using cloud computing based remote sensing The Multiscale Global Monsoon System: Research and Prediction Challenges in Weather and Climate Coherent rainfall zones of the Indian region The Indian Monsoon and Its Variability On the spatial coherence of sub-seasonal to seasonal Indian rainfall anomalies Downscaling of precipitation for climate change scenarios: A support vector machine approach Augmented Convolutional LSTMs for Generation of High-Resolution Climate Change Projections Machine learning for data-driven discovery in solid Earth geoscience & Denolle Marine. Convolutional neural network for earthquake detection and location Continuous chatter of the Cascadia subduction zone revealed by machine learning Supervised machine learning on a network scale: application to seismic event classification and detection MyShake: A smartphone seismic network for earthquake early warning and beyond The efficacy of support vector machines (SVM) in robust determination of earthquake early warning magnitudes in central Japan Automatic earthquake recognition and timing from single traces The detection of low magnitude seismic events using array-based waveform correlation Application of real time recurrent neural network for detection of small natural earthquakes in Poland Deep learning for seismic phase detection and picking in the aftershock zone of 2008 Mw7.9 Wenchuan Earthquake Advances in ungauged streamflow prediction using artificial neural networks Flood hazard mapping methods: A review Intensification of hydrological drought due to human activity in the middle reaches of the Yangtze River, China. Science of The Total Environment 637-638 Performance of a very high-resolution global forecast system model (GFS T1534) at 12.5 km over the Indian region during the 2016-2017 monsoon seasons Monsoon Mission: A Targeted Activity to Improve Monsoon Prediction across Scales Spatio-temporal variability in the stratiform/convective rainfall contribution to the summer monsoon rainfall in India Unraveling the Mechanism of Extreme (More than 30 Sigma) Precipitation during August 2018 and 2019 over Kerala Simulations of Monsoon Intraseasonal Oscillation Using Climate Forecast System Version 2: Insight for Horizontal Resolution and Moist Processes Parameterization Zero-Shot Learning of Aerosol Optical Properties with Graph Neural Networks Deep learning to represent subgrid processes in climate models Combining data assimilation and machine learning to infer unresolved scale parametrization Objective Identification of Nonlinear Convectively Coupled Phases of Monsoon Intraseasonal Oscillation: Implications for Prediction Predicting the MJO using interpretable machine-learning models A self-organizing map-based ensemble forecast system for extended range prediction of active/break cycles of Indian summer monsoon Tropical Cyclone Track Forecasting Using Fused Deep Learning From Aligned Reanalysis Data Deterministic Nonperiodic Flow Large-scale teleconnection patterns of Indian summer monsoon as revealed by CFSv2 retrospective seasonal forecast runs The potential for skill across the range of the seamless weather-climate prediction problem: a stimulus for our science Prediction of the Indian summer monsoon using a stacked autoencoder and ensemble regression model Deep learning for multi-year ENSO forecasts Using network theory and machine learning to predict El Niño Some aspects of the large scale fluctuations of summer monsoon rainfall over India in relation to fluctuations in the planetary and regional scale circulation parameters El Niño Modoki and its possible teleconnection Individual and Combined Influences of ENSO and the Indian Ocean Dipole on the Indian Summer Monsoon Increasing Trend of Extreme Rain Events Over India in a Warming Environment Pacific decadal oscillation and variability of the Indian summer monsoon rainfall Fingerprint of volcanic forcing on the ENSO-Indian monsoon coupling Understanding the combined effects of global warming and anthropogenic aerosol forcing on the South Asian monsoon Atmospheric Aerosols and Trace Gases Stratospheric Aerosol Injection as a Deep Reinforcement Learning Problem Potential and Limitations of Machine Learning for Modeling Warm-Rain Cloud Microphysical Processes Linkage of water vapor distribution in the lower stratosphere to organized Asian summer monsoon convection Learning earth system models from observations: machine learning or data assimilation? Deep learning for post-processing ensemble weather forecasts Physics-informed machine learning: case studies for weather and climate modelling Climbing down Charney's ladder: machine learning and the post-Dennard era of computational climate science Pysteps: an open-source Python library for probabilistic precipitation nowcasting (v1.0) Development of Tracking Technique for the Short Term Rainfall Field Forecasting Machine Learning for Precipitation Nowcasting from Radar Images. arXiv.org A Convection Nowcasting Method Based on Machine Learning Automatic detection and classification of low-level orographic precipitation processes from space-borne radars using machine learning A Machine Learning Approach for Remote Sensing Data Gap-Filling with Open-Source Implementation: An Example Regarding Land Surface Temperature Optical image gap filling using deep convolutional autoencoder from optical and radar images Leveraging Modern Artificial Intelligence for Remote Sensing and NWP: Benefits and Challenges