Parallel Seismic Data Processing Performance with Cloud-based Storage
April 12, 2025 Β· Declared Dead Β· π Seismological Research Letters
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Sasmita Mohapatra, Weiming Yang, Zhengtang Yang, Chenxiao Wang, Jinxin Ma, Gary L. Pavlis, Yinzhi Wang
arXiv ID
2504.09075
Category
physics.geo-ph
Cross-listed
cs.DC
Citations
0
Venue
Seismological Research Letters
Last Checked
3 months ago
Abstract
This article introduces a general processing framework to effectively utilize waveform data stored on modern cloud platforms. The focus is hybrid processing schemes where a local system drives processing. We show that downloading files and doing all processing locally is problematic even when the local system is a high-performance compute cluster. Benchmark tests with parallel processing show that approach always creates a bottleneck as the volume of data being handled increases with more processes pulling data. We find a hybrid model where processing to reduce the volume of data transferred from the cloud servers to the local system can dramatically improve processing time. Tests implemented with Massively Parallel Analysis System for Seismology (MsPASS) utilizing Amazon Web Service's Lamba service yield throughput comparable to processing day files on a local HPC file system. Given the ongoing migration of seismology data to cloud storage, our results show doing some or all processing on the cloud will be essential for any processing involving large volumes of data.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β physics.geo-ph
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
A Machine-Learning Approach for Earthquake Magnitude Estimation
R.I.P.
π»
Ghosted
Uncertainty Estimation with Deep Learning for Rainfall-Runoff Modelling
R.I.P.
π»
Ghosted
Bayesian-Deep-Learning Estimation of Earthquake Location from Single-Station Observations
R.I.P.
π»
Ghosted
Convolutional Neural Network for Convective Storm Nowcasting Using 3D Doppler Weather Radar Data
R.I.P.
π»
Ghosted
Seismic data interpolation based on U-net with texture loss
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted