271
With this procedure, it was possible to identify 89
significant wave heights pairs of extreme events,
obtained from ERA5 and PNBOIA, in which at least
one height of the pair exceeded 4.0 m, during the
period from May 2011 to November 2018.
3.2 Adjustment of the ERA5 significant wave height data
to PNBOIA
Adjusting ERA5 significant wave height data to match
PNBOIA records at Santos Buoy yielded an adjusted
coefficient based on the 89 extreme wave events. This
adjustment was made using a linear regression based
on a scatter plot. This coefficient was considered valid
for the entire ERA5 model series (from 1940 to 2023) in
deep water offshore.
The quality of the regression can be assessed by r-
Squared (r² or the coefficient of determination), which
is a statistical measure that determines the proportion
of variance in the dependent variable that can be
explained by the independent variable. In other words,
r-squared shows how well the data fits the regression
model (the goodness of fit).
3.3 Correlation between nearshore and offshore significant
wave heights
Employing 58 extreme wave events from April 2015 to
July 2023, identified by the POT, a linear regression
based on a scatter plot was obtained between the wave
heights recorded in the Santos Pilots ADCP nearshore
and the wave heights of the ERA5 in deep water
offshore.
3.4 Validation of the correlation nearshore x offshore
significant wave heights
The coefficient of correlation obtained according to
explained in item 3.3 was validated with more
nearshore data of 8 extreme wave events comprised
from July 1982 to September 1984 and more 2 extreme
wave events that occurred in May and July 1973.
3.5 Final correlation of the nearshore fairway waves with
the ERA5
A final weighted coefficient, based on correlation
coefficients from items 3.3 and 3.4, was used for the
entire ERA5 series to estimate the nearshore waves.
3.6 Nearshore waves hindcasting from ERA5 with final
correlation coefficient
Two 19-year periods were considered in the
hindcasting analysis, one called remote, from
01/01/1940 to 31/12/1958, and another called present,
from 08/08/2005 to 08/08/2023. The justification for
choosing this 19-year period was that it corresponded
approximately to the lunar nodal period of 18.61 years,
which we have used for the simultaneous study of the
rise in mean sea level on the Brazilian coast. For each
of these very distant periods, the correlation coefficient
previously determined was applied to the ERA5 series
offshore to obtain the estimate of the nearshore wave
height series. In this way, it was possible to perform a
statistical analysis of the frequency of extreme
nearshore waves and compare significant heights and
the number of extreme wave events.
3.7 Extreme value analysis from ordered nearshore
significant wave heights
The Cumulative Distribution Function (CDF) is
considered the most robust relationship for both
interpolation and extrapolation due to its straight-line
structure, as stated in [8]. The probability that any
wave height (H) exceeds a specified value (P(H)) can
be converted into a straight line by transforming the
probability into reduced variates on the ordinates, and
wave heights on the abscissas. The coefficients of the
slope and the intercept of the straight-line relationship
are determined by linear regression analysis. The most
common distributions employed for analysis of
extreme values of ordered statistical in descending
order of significant wave height are the Log-Normal,
Gumbel and Weibull. The reduced variates employed
by these distributions are known as Z, G and W.
For the Log-Normal Distribution, the Z value is
derived from the Standard Normal Probability Tables.
It is calculated using an equation that considers the
mean and standard deviation of ln H (see [8]). The
scatter diagram with linear correlation is given by Z x
ln H. The coefficients of the slope and the intercept of
the straight-line relationship determined by linear
regression analysis are correlated with the mean and
standard deviation of ln H.
For the Gumbel Distribution, the G value is
obtained reducing P(H) according to logarithmic
functions (see [8]). The scatter diagram with linear
correlation is given by G x H. The coefficients of the
slope and the intercept of the straight-line relationship
determined by linear regression analysis are correlated
with G.
For the Weibull Distribution, the W value is
obtained reducing P(H) according to logarithmic
functions (see [8]). The scatter diagram with linear
correlation is given by G x H. The coefficients of the
slope and the intercept of the straight-line relationship
determined by linear regression analysis are correlated
with W. The Weibull Distribution has three
parameters, in addition to slope and intercept
coefficients a third coefficient (α) will require some trial
and error to obtain the best straight line in the scatter
plot.
4 RESULTS
4.1 Adjusting ERA5 wave eights data to match PNBOIA
Fig. 3 shows the linear regression based on a scatter
plot between significant wave height of PNBOIA and
ERA5. It was observed, according to [9], that ERA5
underestimates the actual values, that is necessary to
correct them by multiplying per 1.300. Under these
conditions, a real significant height in deep water of 4.0
m corresponds to ERA5 3.08 m, or in other words,
ERA5 significant heights of 4.0 m will correspond to 5.2
m in real.