[Paper]Orca: Ocean Significant Wave Height Estimation with Spatio-temporally Aware Large Language Models
/ 3 min read
Updated:Table of Contents
Orca: Ocean Significant Wave Height Estimation with Spatio-temporally Aware Large Language Models
Intro
two categories of SWH estimation:
- traditional
- machine learning-based methods
traditional methods:
- weak1: too slow to estimate in time
- weak2: Difficult to adapt to additional impact factors, limits opportunities for performance improvement.
ML-based methods:
Figure omitted during migration:
image-20241208012411958.pngwas referenced locally but the asset file is not present in this repo.
- challenge1: data is too sparse.
- challenge2: the wave variations have strong spatio-temporal correlations, current methods fail to capture this intricate relationships and discern the wave dynamic patterns implied in the data.
LLM?
- few-shot performance
- tackle the issue of data sparsity in SWH estimation
Contributions
- Propose a LLMs with spatio-temporal awareness -> SWH estimation
- invent a prompt templates and prompt embedding module -> To tackle the data sparsity
- A novel spatio-temporal aware encoding moudle -> Enhance the spatio-temporal reasoning capabilities of LLM
Preliminaries
Significant Wave Height: SWH is defined as the average height of the top 1/3 waves
Buoy-based Data(X): X in R(FxMxT), F features collected by M stationary oceanic buoys over T continuous time intervals
Grid-based SWH: GSWH,Y in R(KxJxT), records the average SWH, each grid area over T continuous time intervals, where K and J indicate the rows and columns, respectively.
Problem Definition
Given the buoy-based data X over T continuous time intervals, to estimate the GSWH values Y of T continuous time intervals.
Methodology
Prompt Designing and Encoding
Divide the prompt into five components:
- Actor
- Information
- Target
- Features
- Data
The word embedding layer of LLMs takes input as the structured prompt P = {p1, p2, … , pE }, resulting in P = {p1, p2, . . . , pE }, where E is the length of the prompt, pi ∈ RD is the embedded vector of the corresponding token pi , where D is the dimension of the semantic space of LLMs
Following P-Turning, randomly generate a fixed number of soft-prompt Q = {q1,q2,…,qR},R is the length of soft-prompt, after embedding, obtain Q = {q1,q2,...,qR}.
For Q, we use a mlp(two linear) to Modeling the dependency relationships between different embeddings, obtaining Hq.
Final prompt is concate Hq and P , get Hprompt=[Hq;P] in R: (R+E)XD
Spatio-temporal Encoding
Spatial Encoding
(challenge2): Any maritime coordinate is inherently influenced by the wave movements in its vicinity, but LLMs has limited spatial reasoning capabilities.
The locations of buoys are denoted as G = {(ui, vi)}, i in 1 to M,where u and v indicate the row and column grid indices for the i-th buoy.
Employ Z-order curve, get Z = Zorder(G), Z ∈ R(MxA), M is the number of buoys and A is one-dimensional binary embedding.
Subsequently, Use a linear on Z, get Hloc ∈ R(MXD).
Temporal Ecoding
the observed data X of buoys is segmented into overlapping patches C = {X(i)}, i from 1 to S, X(i) ∈ R(FxMxL), S = (T-L)/W +2, W, L are the stride and the patch length.
Obtain Htemp in R(SxFxMxD) by a linear.
Final Prompt
Hinput = [Hprompt;Hloc;Htemp] ∈ R(IxFxMxD),and I = R + E + 1 + S
LLMs Fine-tunning
Hllm = LLM(Hinput) in R(IxFxMxD), Hpool = AvgPool(Hllm) in (IxMxD)
use a linear and a Reshape Function, Let Hpool transform to Y’s shape
Optimizing with Physical Regularization
Loss: MSE
- l1:SWH values from buoys and the estimated GSWH values.
- l2:data from numerical model and estimated GSWH values.
loss = l1+ axl2
Experiments
Datasets
- the Buoy-based data is from the National Data Buoy Center dataset (NDBC)
- The corresponding numerical model data is from the Global Wave Database (GWD), provided by the Estuary and Coastal Laboratory
Figure omitted during migration:
image-20241208175257340.pngwas referenced locally but the asset file is not present in this repo.
Figure omitted during migration:
image-20241208175306375.pngwas referenced locally but the asset file is not present in this repo.