ISSN 1004-4140
    CN 11-3017/P

    基于通道及多头注意力机制的深度学习模型对短周期密集台阵资料进行震相识别

    Seismic Phase Identification Using the Records from a Short Period Dense Seismic Array Based on a Deep Learning Model with Channel and Multi-head Attention Mechanisms

    • 摘要: 震相识别作为地震学中最为基础也最为重要的工作,其准确性对于区域地震活动性的深入分析以及地下结构精细探测至关重要。随着地震观测数据的海量增长与机器学习技术的飞速进步,多种基于机器学习的震相识别方法应运而生,如Eqtransformer与Phasenet等,显著推动了该领域的发展。然而,这些模型在处理短周期密集台阵数据时,常面临小微震漏检率高及部分中强地震波形识别不全的挑战。为应对上述问题,本文提出融合通道注意力机制与多头注意力机制的微震震相识别模型。该模型通过残差连接有效融合粗粒度与细粒度特征,利用通道注意力机制优化特征表示,并借助多头注意力机制增强模型对关键信息的聚焦能力。鉴于训练数据多源自信噪比较高的宽频带地震仪,本文采取策略性数据增强方法,以6∶4的比例构建加噪(包含高斯随机白噪声与STEAD数据集噪声)与未加噪数据集,旨在提升模型在复杂野外噪声环境下的泛化能力。实验结果显示,相较于Eqtransformer与Phasenet等现有模型,本研究提出的模型Deepphase不仅较大程度上提高小微震的识别率,还优化P波与S波到时的识别精度。此外,该模型展现出快速的识别速度,能够在10分钟内完成单台站一个月数据的全面识别,体现其在大数据处理中的高效性。本研究通过引入先进的注意力机制与深度学习架构,成功构建一种高效且精确的微震震相识别模型。该模型在提升小微震识别能力、增强模型泛化性以及加速识别过程方面均表现出色,为微震检测提供有力工具。

       

      Abstract: Phase identification is the most fundamental and important task in seismology and is crucial to in-depth analyses of regional seismic activity and the precise detection of underground structures. The considerable growth in earthquake observation data and the rapid advances in machine learning technology have meant that various machine learning-based seismic phase recognition methods have emerged, such as Eqtransformer and Phasenet, which have significantly promoted the development of this field. However, these models often face challenges when processing short-period dense array data, such as high missed detection rates for small microseismic events and the incomplete waveform recognition of medium-to-strong earthquakes. To address these issues, this study developed a microseismic phase recognition model that uses channel and multi-head attention mechanisms. It effectively integrates coarse- and fine-grained features through residual connections, optimizes feature representation using a channel attention mechanism, and enhances the ability to focus on key information through a multihead attention mechanism. Training data are derived from multiple sources and have high confidence-to-noise ratios; therefore this study adopted a strategic data augmentation method to construct noisy (including Gaussian random white noise and STEAD dataset noise) and denoised datasets in a 6:4 ratio to improve the generalization ability of the model in complex field noise environments. The experimental results show that compared to existing models, such as Eqtransformer and Phasenet, the proposed model significantly improves the recognition rate for small microseismic events and optimizes the recognition accuracy of P-wave and S-wave arrival times. In addition, the model has a fast recognition speed and can complete the comprehensive recognition of one month of data from a single station within ten minutes, which demonstrates that it can efficiently process big data. This study successfully constructed an efficient and accurate microseismic phase recognition model by introducing advanced attention mechanisms and deep-learning architectures. This model considerably improves the ability to identify small and micro earthquakes, enhances model generalization, accelerates the recognition process, and provides a powerful tool for micro earthquake detection.

       

    /

    返回文章
    返回