HPC Tutorials – University of Oklahoma

A structured, step-by-step tutorial series for using High-Performance Computing (HPC) resources at the University of Oklahoma, with a focus on OSCER and DISC.

Workflow Setup on the OU HPC Cluster (Overview)

HPC Tutorial 2024 · University of Oklahoma · Author: Hongyu

An end-to-end overview of working on OU HPC systems, designed for first-time users and researchers transitioning to cluster computing.

Includes a concise quick-start checklist for immediate use.

View tutorial overview →

Linux Command-Line Basics for HPC (Part 0)

HPC Tutorial 2024 · OU OSCER & DISC

Core Linux command-line skills required to work effectively in a shared HPC environment.

Focuses on navigation, file operations, permissions, and process awareness.

Read Part 0: Command-line basics →

Accessing OSCER & DISC Resources (Part 1)

HPC Tutorial 2024 · University of Oklahoma

A practical guide to requesting HPC and GPU access at OU.

Covers account setup, approval workflow, and SSH-based login testing.

Read Part 1: Access & login →

HPC Environment Configuration (Part 2)

HPC Tutorial 2024 · OU OSCER & DISC

Guidance on configuring a reproducible and efficient HPC working environment.

Topics include .bashrc, aliases, environment modules, Slurm basics, Conda, CUDA, and SeisBench.

Read Part 2: Environment setup →

Data Management on OU HPC Systems (Part 3)

HPC Tutorial 2024 · OU OSCER

Best practices for organizing, transferring, and storing data on shared HPC file systems.

Covers SCP, SFTP, storage locations, quotas, and common pitfalls.

Read Part 3: Data management →

Slurm Job Submission & Monitoring (Part 4)

HPC Tutorial 2024 · University of Oklahoma

A hands-on guide to running, monitoring, and troubleshooting jobs using the Slurm workload manager.

Key commands: sbatch, squeue, sacct, sstat.

Read Part 4: Job management →

SeisBench Tutorials

Hands-on tutorials for using SeisBench to load, inspect, and visualize seismic datasets for machine-learning-based seismology.

Loading SeisBench Data & Dataset Statistics Visualization

SeisBench Tutorial 2025 · Author: Hongyu

A practical walkthrough for loading SeisBench datasets and performing basic dataset inspection.

Demonstrates how to visualize dataset composition using bar plots to summarize label and class distributions.

Quick start workflow: load a dataset, inspect labels, and generate a bar plot showing sample counts per class.

View SeisBench tutorial →

Research Code Releases

Open-source scientific software projects for seismic data analysis, deep learning, and geophysical research.

SeisBench with Synthetic Seismogram

Open-source Code Release · PyTorch · SeisBench · Author: Hongyu

Project Overview

A comprehensive deep learning framework for seismic phase picking using PyTorch, SeisBench, and synthetic training data. This pipeline provides end-to-end tools for detecting and picking seismic phase arrivals (P and S waves) from three-component seismograms using neural networks.

Key Features

  • Synthetic Data Generation: Create realistic 3-component synthetic seismograms with configurable parameters for training datasets
  • Data Preprocessing: Standardized conversion to SeisBench format for interoperable workflows and reproducible results
  • Deep Learning Models: Train PhaseNet and other neural network architectures for automated phase picking
  • Model Evaluation: Comprehensive evaluation toolset for prediction analysis and performance metrics
  • Visualization Tools: Advanced plotting utilities for seismogram inspection, model outputs, and results analysis

Technology Stack

Python · PyTorch · SeisBench · NumPy · Matplotlib · ObsPy

Repository & Documentation

View on GitHub: seisbench_with_synthetic_seismogram →

Clone: git clone git@github.com:Hy-X/seisbench_with_synthetic_seismogram.git

Getting Started

pip install seisbench torch

Visit the GitHub README for detailed installation and usage instructions. See our SeisBench tutorial → for data loading and visualization examples.

Use Cases

  • Training phase-picking models with synthetic seismic data
  • Benchmarking deep learning architectures for seismic signal detection
  • Research in automated seismic event detection and location
  • Educational resource for machine learning in geophysics

Oklad Annotation Workflow

Open-source Code Release · Python 3.7+ · Annotation Tools · Author: Hongyu

Project Overview

A comprehensive workflow for annotation tasks using the Oklad model. This repository provides optimized, independent scripts and tools for automated annotation processing, enabling efficient and scalable data annotation pipelines.

Key Features

  • Oklad Model Integration: Leverages the Oklad model for high-quality automated annotation processing and semantic understanding
  • Modular Script Architecture: Independent, reusable scripts that can be run separately or chained together for flexible workflows
  • Batch Processing: Efficient handling of large-scale annotation tasks with optimized performance for production environments
  • Data Preprocessing: Built-in tools for data validation, cleaning, and formatting before annotation
  • Result Export: Multiple output formats for seamless integration with downstream analysis tools

Technology Stack

Python 3.7+ · Seisbench · NumPy · Pandas · JSON/YAML

Repository & Documentation

View on GitHub: oklad_annotate_workflow →

Clone: git clone git@github.com:Hy-X/oklad_annotate_workflow.git

Getting Started

Requires Python 3.7 or higher. Clone the repository and follow the setup instructions in the README.

Visit the GitHub README for detailed installation and usage instructions, including example workflows and configuration options.

Use Cases

  • Automated large-scale annotation of research datasets
  • Semantic analysis and text/data annotation pipelines
  • Batch processing of annotation tasks in production environments
  • Integration into data preprocessing workflows for machine learning projects

Python

Oklahoma Earthquake Streamgraph

[Python_Tutorial] @ Hongyu

Explore the seismicity trends in the Oklahoma region from 2010 to 2024 using data from the Oklahoma Geological Survey (OGS) seismic catalog. This Python tutorial demonstrates how to create a streamgraph to visualize earthquake data over time. Learn how to work with seismic datasets and generate insightful visualizations for earthquake analysis.

Oklahoma Events Streamgraph

Download the example text file for this example to follow along and recreate the visualizations.

Click here for detailed content!

Read CSV file in pandas and make scatter plots

[Python_Tutorial] @ Hongyu

How to read CSV file into pandas ? Here is an example how to load the CSV file and make scatter plots. Examples includes simple scatter plot, scatter plot with symmetric error bars, scatter plot with asymmetric error bars, and plotly scatter plots

CSV file used for this example and this is the Notebook file

Click here for detailed content !

Setup Anaconda/jupyter notebook/modules

[Python_Tutorial] @ Hongyu

How to set up an Anaconda environment to use python in earth science ? This is the start of everything before using it. This post shows instructions on installing Anaconda, Jupyter notebook, Pandas and other useful modules

Click here for content !

北京大学2020年暑期学内容大纲

[PKU Summer School 2020] @ Hongyu

北京大学2020年地球物理暑期学校

实验部分 课程涵盖内容

点击查看内容 !

Jupyter Notebook简介和使用入门

[PKU Summer School 2020] @ Hongyu

什么是Jupyter notebook? 为什么要使用Jupyter Notebook?

如何安装Jupyter Notebook,入门的的安装和调试是怎样的?为什么大家喜欢使用这个平台进行调试?

点击查看内容 !

Numpy 简介和使用入门

[PKU Summer School 2020] @ Hongyu

什么是Numpy? 为什么要使用Numpy?

如何安装Numpy ,入门是怎样的? 有什么基本的使用方法?

点击查看内容 !

Python 简介和使用入门

[PKU Summer School 2020] @ Hongyu

什么是Python? 为什么要使用Python?

快速入门Python,什么是基本的语法?有什么基本的结构 或者 变量? 如何看懂最基本的Python代码?

点击查看内容 !

PyGMT Tutorial: Station Map Visualization

Hands-on tutorial for creating a high-resolution topography map with station locations using PyGMT. Example uses Oklahoma City.

Creating a Station Map with PyGMT

PyGMT Tutorial 2026 · Author: Hongyu

Step-by-step guide for plotting seismic stations on top of a high-resolution topography map using PyGMT. Ideal for visualizing station layouts and regional topography for seismic studies.

Includes Python code examples for loading station coordinates, customizing map appearance, and adding annotations for clarity.

Quick start workflow: import PyGMT, load topography, plot stations, add map features.

View Jupyter Notebook: Station Map Example →

Finding Appalachian Long-Term BH Stations & Plotting with GMT

PyGMT + ObsPy Tutorial 2026 · Author: Hongyu

Guide for querying seismic stations in the Appalachian region using ObsPy, with filters on bounding box, time range, and channel selection (here using BH channels).

Save the station metadata as a CSV file for further analysis, and then plot the stations on a regional map using PyGMT for visualization.

Quick start workflow: define region & time filters, query stations via ObsPy, export CSV, and create a high-resolution station map with PyGMT.

View Jupyter Notebook: Appalachian BH Stations →

Map of Appalachian BH seismic stations
Station map example for Oklahoma City using PyGMT.

CShell

地震学Shell常用命令

[PKU Summer School 2020] @ Hongyu

在服务器上常用的Shell命令都有哪些?如何使用?

如何进行删改增查?如何既定重定向?如何组合使用多种命令?

点击查看内容 !

Seismicity Plot

[GMT plotting] @ Hongyu

How to use PSXY to make a GMT plot with seismicity ? How to organize the data for psxy?

How to setup the background basemap and how to add the color bar with it?

Click here for content !

Location Plot

[GMT plotting] @ Hongyu

How to use PSXY to make a station location plot on top of CRUST 1.0? How to organize the data for psxy?

How to setup the background basemap and how to add the color bar with it?

Click here for content !

SAC

地震学SAC常用命令

[PKU Summer School 2020] @ Hongyu

在服务器上常用的SAC命令都有哪些?如何使用?

如何进行查看SAC表头信息?如何进行基本的绘图? 如何脚本调用?如何做基本的滤波等处理?

点击查看内容 !

Miscellaneous

Download AGU2020 Video Using youtube-dl

[AGU2020_Notes] @ Hongyu

AGU2020 went online! How to keep millions of high quaility scientific talks from AGU website and learn it at a later time? Try to use youtube-dl, free command line video downloader to download these videos!

Click here for detailed content !

Illinois State Geological Survey 2022 Summer HVSR

[Summary Report] @ Hongyu

During the summer of 2022, I explored the Horizontal-to-Vertical Spectral Ratio (HVSR) method for estimating sediment thickness in Illinois as part of a project with the Illinois State Geological Survey. This summary includes my work using the IRIS-HVSR package and integrating it with ResShake3D for practical applications in sedimentary basin analysis.

HVSR Analysis

Learn how the HVSR method can be applied effectively in field studies and how to bridge software tools for efficient geological analysis.

Click here for detailed content!

Contact Me
Feel free to contact me

Address

1301 W, Green St., Urbana, U.S.A

Phone

+1779-232-3992

Email

hongyux2@illinois.edu