Automated Classification of Gust Events in The Contiguous USA
Automated Classification of Gust Events in The Contiguous USA
A R T I C L E I N F O A B S T R A C T
Keywords: Extreme gust observations near the ground, which are critical in assessing the impact of wind on structures, are a
Thunderstorms disjoint mixture of events created by different meteorological mechanisms which require separation before
Frontal downbursts assessment. Classification by visual inspection of the meteorological data becomes impractical for very large
Mesoscale gusts
datasets. In this study recent automated approaches that use statistics, pattern recognition and/or neural net
Synoptic-scale gusts
ASOS
works, were calibrated against 9014 visually classified gust events of ≥40kn from 25 locations across the USA
Kernel density estimation over 22 years. The visual classification distinguished between five classes of valid gust event: synoptic scale
K-means storms, deep convection, the forward flank and the rear flank of gust fronts, and downbursts from isolated
Neural network thunderstorms; and between two classes of artefact. The ensemble-averaged timeseries of each class formed a
Shapelet transform distinctive hierarchy. The misclassification rate against the visual classification varied between 24% for the
Fuzzy membership poorest method to 10% for the best method, with most differences between adjacent classes. Extending the most
promising method to include contemporaneous temperature and pressure observations reduced the misclassifi
cation rate to 2.3%. When applied to >107 gust events ≥20kn from 450 locations across the USA, the class
hierarchy remained stable. Implementation is by open-source R scripts.
https://doi.org/10.1016/j.jweia.2023.105330
Received 27 November 2022; Received in revised form 19 January 2023; Accepted 24 January 2023
Available online 9 February 2023
0167-6105/© 2023 Elsevier Ltd. All rights reserved.
N.J. Cook Journal of Wind Engineering & Industrial Aerodynamics 234 (2023) 105330
either order: a) identify all gust events, then classify each event; or b) gust factor events that lacked the expected temperature/pressure
pre-define classes, then search for events of that class. In chronological anomalies as artefacts.
order of publication:- • 2020, Samanta et al. (2020): detected thunderstorm days for
pre-monsoon winds at Kolkata from cloud-base height and potential
• 1992, Twisdale and Vickery (1992) classified thunderstorm gusts as temperature at the 850 hPa level.
the maximum gust observed on “thunderdays” – days where thunder • 2020, Chen and Lombardo (2020): built a convolutional Neural
is seen or heard. Network (NN) that differentiated between thunderstorm and syn
• 1999, Choi (1999) identified thunderstorm gusts by visual search optic gusts in 1-min interval wind observations from the US Auto
through 13 years of anemograph charts and other records. mated Surface Observing System (ASOS). This was trained using 76,
• 2002, Choi and Hidayat (2002) identified thunderstorms in the 20 480 records of 91-min duration, centred on a maximum gust ≥40kn,
highest gusts per year as coinciding with “thunder and rain” to and mutually separated by ≥ 45 min. The thunderstorms in the
calibrate the difference in gust factor between thunderstorm and training set were automatically identified from the thunderstorm flag
monsoon winds in Singapore. in the corresponding METAR, as in (Lombardo et al., 2009). Training
• 2002 Kasperski (2002), for Germany, defined the three classes: the NN took 161 CPU hours. They noted an issue of false positive
depression, gust front, and thunderstorm, for gusts separated by 24h, spikes in the ASOS data that were classified as thunderstorm gusts.
identifying by the peak, mean and gust factor, but found it difficult to • 2022, Arul et al. (2022): Used 240 non-stationary 1h-periods of wind
separate depressions from fronts. speed to train a Stationary Shapelet Transform (SST) – a method
• 2009 Lombardo et al. (2009) used the thunderstorm flag in METAR originally developed (Ye and Keogh, 2011) to find patterns in
to identify hourly maximum gusts from thunderstorms. electro-cardiograms. Here a “shapelet” is a short timeseries that is
• 2014, De Gaetano et al. (2014) adopted a statistical approach, characteristic of an event class. From 168 records of 1-h duration at
evaluating 12 parameters: peak 1s gust; 1-min mean speed; 10-min 10Hz from sonic anemometers, and without presuming any class
mean, and 1-h mean: speed, direction, turbulence intensity, skew structure, an automatic process generated 35,998 candidate shape
ness, and kurtosis; of 2Hz–10Hz sonic anemometer data. They used lets which were winnowed down to 32 “mother shapelets” that best
various combinations of gust factor in a logic tree to identify the represented recurring shapes in the timeseries. On visually classi
same three classes as Kasperski (2002) but could not definitively fying these shapelets, 21 indicated thunderstorm gusts and 11 indi
separate fronts and thunderstorms – “classifying an event not attrib cated synoptic gusts. Transforming the whole timeseries with each
utable to a depression (D) as a thunderstorm (T) or a gust front (F) is the mother shapelet gave a set of coefficients containing peaks, each
ratio G10/G60: when it is less than 0.90, the event is usually a thun corresponding to a section of the timeseries matching a mother
derstorm (T); when it is greater than 0.90, the event is usually a gust front shapelet, so indicating a classified gust event. Although the classifi
(F).” cation is only binary, thunderstorm/non-thunderstorm (T/NT),
• 2019, Huang et al. (2019) used daily outliers in gust, mean temp and multiple mother shapelets were required because of the high vari
mean humidity: Gust >15 m/s, plus outliers of temperature and ability of thunderstorm shapes.
humidity within 20 min of the peak gust, to identify “thermal
ly-developed” wind. 2. Motivation for this study
• 2019, Guerova et al. (2019) used instability indices and integrated
water vapour to predict thunderstorm activity some minutes before In each of the previous studies, summarised above, the methodology
observed lightning flashes. This is just one example of similar pre used one of three basic approaches.
dictive methods using satellite data.
• 2019, Vallis et al. (2019) identified thunderstorm downbursts from a) Conventional statistical moments of the wind speed timeseries
the gust factor and from anomalies in atmospheric pressure, tem (Kasperski, 2002) (Solari et al., 2020).
perature and dewpoint. Through the passage of cold fronts, they b) Pattern recognition applied to the wind speed timeseries (Arul et al.,
differentiate between the expected synoptic changes of temper 2022) (Chen and Lombardo, 2020).
ature/pressure and non-synoptic events. They also classified high
2
N.J. Cook Journal of Wind Engineering & Industrial Aerodynamics 234 (2023) 105330
A Development set of twenty-five ASOS stations were chosen for this 3.3. TD6406
study. Indicated by the crosses in Fig. 1, they are distributed across
CONUS, except for two stations serving Dallas, TX, three serving Chi Relevant data in each TD6406 file are atmospheric pressure from
cago, and three serving Washington, DC, intended to expose any non- three independent sensors, dry bulb temperature, dewpoint and pre
geographical disparities. Five stations were selected for their unreli cipitation. The high precision of the pressure sensors, ±0.0001 in Hg,
ability, e.g., KPRB Paso Robles, CA, to stress-test the classification and their triplication is required for accurate setting of altimeters in
methods. Also shown by grey circles in Fig. 1 are the locations of the landing aircraft. The temperatures are recorded and processed to 0.1 ◦ F
Analysis set of 450 ASOS stations with WMO Class 1 or 2 exposures precision but are reported in integer ◦ F increments. Only the precipita
(Cook, 2021), selected for fuller geographic cover. Relevant metadata tion type codes: NP = no precipitation, R = rain and S = snow, preceded
for the 25 development stations are given in Appendix A. with – or + for light/heavy, are documented (NCDC, 2006b) so the
meanings of the other codes are unknown. Precipitation amount is
measured by a tipping-bucket gauge in 0.01-inch increments which has
1
The TD6405 and TD6406 files from 2000 to June 2022 may be downloaded no practical value at 1-min intervals, only when integrated over longer
by FTP or HTTP. NCEI has transitioned to HTTP only, with observations from periods.
January 2022 onwards available here, updated monthly. TD6406 contains the same kind of errors as TD6405, including the
3
N.J. Cook Journal of Wind Engineering & Industrial Aerodynamics 234 (2023) 105330
Fig. 1. Locations of ASOS stations used. Crosses: Development set of 25 stations. Circles: Analysis set of 450 stations.
duplication problem but excluding bird-perching, and the same auto calibrating, tuning or training the automated classification method.
mated procedures were used to identify and correct. Curiously, the pe There is consensus in these studies, e.g., (Canepa et al., 2020) (Choi,
riods of duplication and of gaps in the data are often not 1999), that downbursts from thunderstorms and gust fronts are identi
contemporaneous with TD6405 and this leaves some gust events fiable by visual inspection of the meteorological record. The ASOS 1-min
without corroborating TD6406 data. interval 3s gust data permits resolution of gust events persisting for ~3
min, or longer. This includes downbursts from isolated thunderstorms,
4. Independent gust events downbursts embedded in the forward and rear flanks of fronts, and
downbursts able to penetrate through the ABL to the surface from deep
For this study, an “independent gust event” is defined as a 1-h period convection in moderately strong and steady winds. The aim here is to
centred on the maximum gust and separated from other gust events by at identify classes corresponding to the meteorological characteristics
least 30 min. The consensus from the earlier studies is that thunderstorm these physical mechanisms.
downbursts generally last less than 10 min, so this dead-time between The gust events were visually sorted into five valid Classes (1–5),
events complies with the common Wind Engineering rule-of-thumb of 3 plus one (6) to collect the characteristic “spike” artefacts, and another
× timescale for effective statistical independence. Thirty minutes are not (0) to collect events that are unclassifiable due to multiple artefacts or
sufficient for independence of non-convective gusts in synoptic-scale instrumentation malfunctions. These are designated here as.
windstorms, which require a longer separation.
Following the approach of Lombardo and Zickar (2019) and earlier 1. Unclassifiable – comprising events with large data gaps or with non-
studies, any observations that might have come from hurricanes were meteorological artefacts, e.g., instrumentation malfunctions.
removed over the three-day period centred on the arrival of the hurri 2. Synoptic – comprising non-convective gusts generated by the ABL in
cane eye into the relevant State. Owing to their rarity, hurricanes are synoptic-scale weather systems and near-neutral atmospheric sta
assessed differently from the frequent synoptic and convective wind bility. Steady, strong, locally stationary wind with low gust factor;
mechanisms (Vickery et al., 2009). This was relevant only to the coastal little variation in direction and temperature; linear or no trend in
US States along the Gulf and Eastern Seaboard. atmospheric pressure; precipitation absent or continuous.
Following Chen and Lombardo (2020), all gust events >40 kn and 3. Storm-burst – comprising short-duration non-stationary events in
separated by at least 30 min at each Development station were extracted otherwise strong steady winds, sometimes with discernible variation
to give a Development set of 9014 gust events. Initially, a recursive in temperature or pressure. May include downdrafts from deep
search was made of each record to find the next highest gust event, convection which penetrate through the gust structure of the ABL.
which was simple to implement and validate but very slow to execute. 4. Front-down – comprising convective downdrafts in the rear flank of
Execution time was shortened by a factor of ~60 by first extracting the active fronts, where the mean wind speed is decreasing from a higher
much smaller set of local maxima and their times of occurrence and steady value. Mean gust speeds after the peak are lower than before;
searching this. By excluding the period of each gust event from future direction veers or backs; temperature drops rapidly through the
searches, the recursion cycle became progressively quicker. This event; pressure increases temporarily when the downburst is directly
allowed an Analysis set of all gust events above ≥20kn to be extracted over the station, otherwise the variation is that expected for the
from each of the 450 stations in Fig. 1, comprising >107 events. passage of a front.
5. Front-up – comprising convective downdrafts in the forward flank of
5. Datum classification by visual inspection active fronts, where the mean wind speed is increasing to a higher
steady value. Mean gust speeds after the peak are lower than before;
5.1. Gust event classes usually, but not always, associated with a change in mean wind di
rection; otherwise, like Front-down.
All the earlier studies require a datum set of classified events for
4
N.J. Cook Journal of Wind Engineering & Industrial Aerodynamics 234 (2023) 105330
6. Thunderstorm – comprising downbursts from isolated thunderstorms, • There is a sudden drop in the temperature anomaly (red line) coin
often in relatively light winds, where the initial wind speed and di cident with the peak gust that is maintained over the following 30
rection are restored after the event. Temporary sharp rise in gust min (scale ±10 ◦ F).
speed over several minutes; temporary change in gust direction; • The pressure anomaly (blue line) rises through the peak gust and
sudden large drop in temperature; temporary increase of pressure partially recovers after around 7 min (scale ±0.05in Hg).
when the downburst is directly over the station, otherwise slight rise • There are bursts of heavy rain (purple bars) for 11 min following the
due to temperature drop; often a sudden burst of heavy rain. peak gust.
7. Spike – comprising single isolated instrumentation/transmission
spikes, possibly including very short-duration surface-generated The suggested physics for this event is that the downburst was
thermal events like dust-devils. Bird-generated gusts register as sin initiated just upwind of the station, so that the sudden increase in gust
gle spikes with no loss of data if the acoustic path of the sonics is speed and drop in temperature corresponds to the leading gust front and
blocked for less than 30s (Cook, 2022). Associated changes in indi the later increase in pressure corresponds to the center of the downburst
cated direction are not reliable as they come from the same instru as it is advected past the station. This example illustrates the most
ment. There should be little variation in temperature or pressure. critical classification between Class 5 Thunderstorm and Class 6 Spike for
Multiple Spike artefacts within the event period are included in Class extreme-value analysis (EVA) where, ideally, all artefacts are found and
0 to preserve the ensemble-averaged character of Class 6. removed, and no valid thunderstorms are lost in doing this.
5
N.J. Cook Journal of Wind Engineering & Industrial Aerodynamics 234 (2023) 105330
Fig. 3. Typical (left) and ensemble-averaged (center) visually classified gust events >40kn at the 25 Development stations, and (right) of all gust events >40kn at the
450 Analysis stations classified by TNNDT method.
• (a) and (b) present “gust intensity”, the standard deviation of the 3s mean. This is the reason Kasperski (2002) and De Gaetano et al.
gust divided by its 10-min and 1h datum mean, respectively. (2014) can classify T/NT, but not identify the frontal classes.
Although there is a left-right trend in the mode with Class, there is • (e) presents the speed trend, the change from the mean for the 30 min
considerable overlap between distributions. before the peak to the 30 min after the peak, divided by the mean for
• (c) and (d) present the conventional gust factor, peak divided by the hour centred on the peak. This shows a good separation between
mean, for the same datum means. There is considerable overlap in (c) Class 3: Front-down and Class 4: Front-up, with the other classes
using the 10-min mean, but better separation in (d) using the 1h clustered together in between.
6
N.J. Cook Journal of Wind Engineering & Industrial Aerodynamics 234 (2023) 105330
Fig. 3. (continued).
• (f) presents the gust “emergence”, defined as the peak gust divided 6.1.2. Metrics for gust direction
by the mean of the 10 next-highest local peaks in the event. Like gust There are often clear variations in wind direction through a gust
factor, it is a measure of sharpness of the central peak. This is a novel event but, as noted earlier, direction is not a reliable discriminator for
metric designed to overcome an issue with gust factor when applied Spike artefacts as it is produced by the same sonic anemometer as the
to the frontal Classes 3 & 4. speed. Fig. 6 presents the KDEs for veer: the difference between the peak
and the incident mean, and trend: the change in 30-min mean from
Although gust factor appears to provide better separation than before to after the event. These KDEs are clustered around the origin and
emergence, the datum mean for Front-down and Front-up lies somewhere provide no useful discrimination, although Classes 3, 4 & 5 show an
between the high/low value before the central peak and the low/high increasing asymmetry in trend. Gomes and Vickery (1978) commented
value afterwards, so is representative of neither. Gust factor therefore in 1978 that the average peak thunderstorm gust direction remains
tends to exaggerate frontal events, biasing the result. The proposed new consistent with the approach mean direction, so this negative result was
emergence metric indicates how far the peak emerges above the enve expected. Both Class 4: Front-up and Class 5: Thunderstorm direction
lope of its peers. Whether these occur before, after or either side of the trends show a slight trend bias which is too small to be useful.
peak is irrelevant, so it treats all the event classes fairly and without bias.
7
N.J. Cook Journal of Wind Engineering & Industrial Aerodynamics 234 (2023) 105330
8
N.J. Cook Journal of Wind Engineering & Industrial Aerodynamics 234 (2023) 105330
9
N.J. Cook Journal of Wind Engineering & Industrial Aerodynamics 234 (2023) 105330
Fig. 7. Bivariate k-means clusters and kernel density contours of gust “trend” and “emergence” metrics.
Table 2
Confusion matrix for Bivariate k-means method.
Bivariate k-means
Class 0 1 2 3 4 5 6
0 0 13 11 15 6 17 1
1 0 5606 675 83 15 27 0
2 0 44 581 2 2 71 0
3 0 13 0 222 0 17 0
4 0 0 78 0 454 20 0
5 0 36 184 82 38 224 4
6 0 1 11 30 18 122 291
Error 18.2%
Table 3
Confusion matrix for Highest Kernel Density method.
Highest Kernel Density
Class 0 1 2 3 4 5 6
0 0 9 20 18 9 6 1
1 0 5517 750 79 60 0 0
2 0 39 640 2 0 18 1
3 0 14 0 238 0 0 0
4 0 0 2 0 549 1 0
5 0 43 222 76 36 160 31
6 0 0 20 12 9 70 362
Error 17.2% Fig. 8. Look-up table for Highest Kernel Density method.
10
N.J. Cook Journal of Wind Engineering & Industrial Aerodynamics 234 (2023) 105330
11
N.J. Cook Journal of Wind Engineering & Industrial Aerodynamics 234 (2023) 105330
Fig. 10. Kernel Density Estimates of statistical metrics for temperature anomaly.
Fig. 11. Kernel Density Estimates of statistical metrics for pressure anomaly.
7.2.2. Trivariate Neural Network Direct Traces method (b) Bivariate KDE: Temperature span with gust speed trend and emergence
The Trivariate Neural Network Direct Traces (TNNDT) method ap Vclass 0 1 2 3 4 5 6
pends the temperature and pressure anomaly values to the gust speed to 0 34 6 0 0 0 0 0
give 183 inputs into neuralnet. Adding the temperature and pressure 1 434 6093 1 6 31 166 0
2 63 243 8 0 7 43 0
anomaly traces required normalisations that preserved the relative scale
3 57 50 7 43 0 69 0
of each individual trace yet approximated to the (0,1) range, on average, 4 130 25 38 0 322 25 0
for all traces of the Development set. 5 43 49 23 16 46 391 1
The Classes hierarchy is not a continuous sequence because the speed 6 54 24 50 9 0 9 313
traces for Classes 3 and 4 in Fig. 3 are an antisymmetric pair. It is better Error 19.3%
3
Evaluated by the kde3d function in the R package “misc3d” by Dai Feng and
Luke Tierney.
12
N.J. Cook Journal of Wind Engineering & Industrial Aerodynamics 234 (2023) 105330
13
N.J. Cook Journal of Wind Engineering & Industrial Aerodynamics 234 (2023) 105330
Fig. 14. Thunderstorm events in the upper tail misclassified by TNNDT as Spike.
valid event.4
A preliminary, simplified, XIMIS (Harris, 2009) analysis for the dis
tribution of annual maximum gust speed was made to assess the impact
on the 50-year return gust speed, V50. The simplifications included
taking no account of serial correlation, nor of asymptotic convergence,
and merging the five valid Classes into the binary T/NT classes. The
analyses were performed for each Development station assuming the
classical Fisher-Tippet type 1 model (Gumbel, 1958) (Castillo, 1988)
was applicable to both T/NT classes:
y = (V − U)/b Equation4
1. The NT classes, Fig. 15, show that the highest 63 values in the Visual
and TNNDT sets are identical, leading to identical estimates of V50 to
0.1kn tolerance. The simplified confidence limits correspond to ±σ
from Equation (3) and, in this case, all values lie within these limits.
2. The T classes analysis, Fig. 16, shows the 3rd-highest Visual value is
missing from the TNNTD values, resulting in a 1.8kn underestimate
of V50.
3. The resulting T/NT mixture model, Fig. 17, resolves as a curve which
is asymptotic to NT in the lower tail and T in the upper tail. The
predicted values of V50 are slightly higher than the T values due to
the small additional contribution from NT. Fig. 16. XIMIS analyses of T classes (Classes 3, 4 & 5) for KORD, Chicago
O’Hare, IL.
Table 10 lists the estimates of the 50-year return gust speed, V50,
from each step in the simplified XIMIS analysis for all Development
stations. In general, missing valid values in the upper tail, y > 0, produce
4
SPC reports concurrent wind damage in nearby Irving. Some hours later, underestimates and surviving artefacts produce overestimates. The
tornados seen in nearby Burleson county. principal misclassification anomaly for the 6 largest discrepancies
14
N.J. Cook Journal of Wind Engineering & Industrial Aerodynamics 234 (2023) 105330
Table 10
Estimates of 50-year return gust speed by XIMIS for all Development stations.
V50 (kn) T = Classes 1 & 2 NT = Classes 3, 4 & 5 T/NT mixture
Kcode Visual TNNDT Error Visual TNNDT Error Visual TNNDT Error
KABQ 71.7 71.7 0 68.1 68.2 0.1 73.2 73.3 0.1
KBUF 70.4 69.1 − 1.3 67.8 67.8 0 72.3 71.5 − 0.8 Missing T rank 6
KBWI 59.5 59.6 0.1 57.9 57.9 0 61.1 61.2 0.1
KDAL 64.3 64.0 − 0.3 55.3 55.3 0 64.6 64.4 − 0.2
KDCA 64.8 63.5 − 1.3 59.4 59.4 0 65.5 64.5 − 1
KDEN 70.4 70.3 − 0.1 64.9 64.9 0 71.2 71.1 − 0.1
KDFW 75.9 74.6 − 1.3 56.7 56.7 0 75.9 74.6 − 1.3 Missing T rank 3
KGFK 71.5 72.0 0.5 62.3 62.8 0.5 71.8 72.4 0.6
KGTF 69.5 68.7 − 0.8 68.2 68.1 − 0.1 71.8 71.3 − 0.5
KIAD 60.8 65.1 4.3 62.1 62.4 0.3 64.2 66.9 2.7 Outlier T rank 1
KICT 78.4 77.6 − 0.8 63.4 63.8 0.4 78.4 77.7 − 0.7 Missing T rank 5
KILM 67.2 67.8 0.6 54.3 54.3 0 67.3 67.9 0.6
KJAN 62.8 62.2 − 0.6 61.0 61.0 0 65.5 65.4 − 0.1
KLAS 62.8 64.4 1.6 61.5 61.5 0 65.1 66.1 1
KLAX NA NA NA 46.6 46.6 0 46.6 46.6 0
KMDW 77.8 78.2 0.4 62.6 62.6 0 77.9 78.2 0.3
KORD 70.5 68.6 − 1.9 59.9 59.9 0 70.7 68.9 − 1.8 Missing T rank 3
KPNS 64.6 64.6 0 71.0 71.0 0 72.7 72.7 0
KPRB NA NA NA 52.9 52.9 0 52.9 52.9 0
KPWM 52.3 51.6 − 0.7 63.4 64.3 0.9 63.5 64.4 0.9 Outlier NT rank 1
KRFD 70.7 70.7 0 57.7 57.6 − 0.1 70.8 70.8 0
KSEA 50.8 50.8 0 60.0 59.9 − 0.1 60.2 60.1 − 0.1
KSUX 70.6 70.6 0 65.6 65.6 0 71.7 71.7 0
KTPA 50.4 50.4 0 63.2 63.2 0 63.3 63.3 0
KWMC 62.9 63.2 0.3 60.7 60.6 − 0.1 64.7 64.8 0.1
Uncorrected std error 1.22 0.23 0.85
Corrected ranks 1 to 6 std error 0.53 0.11 0.36
15
N.J. Cook Journal of Wind Engineering & Industrial Aerodynamics 234 (2023) 105330
16
N.J. Cook Journal of Wind Engineering & Industrial Aerodynamics 234 (2023) 105330
Thunderstorm with apparently a lowly rank close to the annual mode, check events in the upper tail against independent sources.
however most of the higher ranks are artefacts. There is only one higher
valid event of 65 kn, so that the missing peak is likely to have been the 8.3.7. Spurious sustained high gust speeds
maximum in the record. The event in (b) is classed as a 76kn Front-up Occasionally, a sonic anemometer may malfunction and transmit
and is the highest valid value in the record which, interpolating the spurious high gust speeds sustained over an hour, so that the spurious
shape of the surviving trace, was quite possibly not exceeded in the cull. gust event classifies as Synoptic, as in Fig. 24(a). However, the traces of
gust and mean speed, viewed in (b) over a longer period, suggests the
8.3.6. Hail or freezing rain event is not of meteorological origin and this is confirmed by SPC. Valid
Fig. 23 shows two examples of apparent (a) relatively steady and (b) synoptic-scale events may often be confirmed by a neighbouring station,
transient high gust speed events occurring in periods of hail or freezing as in Fig. 24(c) and (d). In cases where there is no SPC report and the
rain, respectively. TNNDT classifies both (a) and (b) as Synoptic. wind speeds have been culled in the METAR record, events in the upper
XTNNDT classifies both (a) and (b) as transitional between Unclassifiable tail have been visually classified as Unclassifiable.
and Synoptic. On checking SPC both are confirmed as artefacts. Despite
searches, no information has been found on the performance of sonic
anemometers in hail and freezing rain. This again reinforces the need to
17
N.J. Cook Journal of Wind Engineering & Industrial Aerodynamics 234 (2023) 105330
8.4. Geographic distribution of event frequency serial correlation, so that each synoptic-scale storm contributes
multiple events.
Fig. 25 presents maps of the geographical distribution of the annual
rate of gust events in each XTNNDT Class (XClass) for all gust events The remaining XClasses, 2 to 5, representing non-stationary events,
≥20kn to the end of 2021. are also strongly correlated with Station altitude, with some subtle
differences.
• For reference, (a) presents the elevation of each Station which
highlights the Great Colorado Plateau (GCP) west of the Rocky • Storm-burst events in (d), although defined as non-stationary in
Mountains and east of the Sierra Nevada and Cascades, as well as the terruptions in synoptic-scale storms, will include events transitional
Appalachians in the northeast. The elevation contours from (a) are between Synoptic and the two frontal classes, so are expected to
superimposed on the XClass annual rates in the other maps. inherit much of their character. The higher rate in the northeast is
• The rate of artefacts: 0 Unclassifiable and 6 Spike in (b) and (h) not shared with the frontal classes.
correlate strongly with station altitude which suggests that instru • Front-down, the least frequent XClass, is more frequent in Florida and
mentation reliability is the primary cause, in contrast with the NOAA less in the northeast.
preoccupation with perching birds (NOAA, 2013). • Front-up is, after GCP, more frequent in the southern States, espe
• Synoptic events in (c) are most frequent in a north-south band cially Texas and Florida.
through the High Plains – the very high annual rates being due to
18
N.J. Cook Journal of Wind Engineering & Industrial Aerodynamics 234 (2023) 105330
• Thunderstorm is, after GCP, more frequent east of the Mississippi, instrumentation, QC and reporting procedures, as well as possible effects
particularly along the Appalachians and in Florida. of climate changes, during the 22-year observation period of the fre
quencies of occurrence. The observational period in Fig. 26 and Fig. 27 is
These observations are of frequency of occurrence and are not divided into three sub-periods: from 2000 to 2006 for the cup ane
directly related to value, i.e., gust strength, which requires additional mometers6; 2007–2013 for the sonic anemometers before QC Test 10;
processing to eliminate serial correlation before applying EVA. This is and 2014–2021 for sonic with QC Test 10.
the subject of a separate study.
19
N.J. Cook Journal of Wind Engineering & Industrial Aerodynamics 234 (2023) 105330
Fig. 23. Spurious gust speeds in (a) hail and (b) freezing rain.
20
N.J. Cook Journal of Wind Engineering & Industrial Aerodynamics 234 (2023) 105330
Fig. 24. Spurious (a, b) and valid (c, d) sustained high gust speeds.
• Class 3 is the least frequent and would be dominated by the four- (Mahalanobis, 1936), a generalised measure of separation, on the
times more frequent Class 4 if the frontal classes were merged. taxonomic dendrogram, Fig. 29.
In Fig. 29 the best separation is between the cold front events, 4:
Canonical Variate Analysis (CVA) has developed considerably from Front-up, and the remaining T/NT/A classes – achieving the principal
its inception (Hotelling, 1933). It is widely used in taxonomy to distin objective of this study. The second best is between T/NT and Artefact
guish between species, and in morphology for facial recognition. The and the third between T and NT. All these separations are in the upper
method uses correlation to define canonical variates which are linear quartile of the range of Mahalahobis distance. The final separations form
combinations of the original data that maximally separate classes. three pairs: the best between Classes 3 and 5, and the worst between
Applied7 to the checked TNNDT classes and timeseries, it produced the Classes 1 and 2. It follows that the only reasonable candidates for
CM of Table 14 with an error rate comparable with the discarded merging are (0,6) as Artefacts to be discarded and (1,2) as ABL gusts,
methods in Table 7. Although it uses the same data, its performance falls which is how they are presently treated in design practice.
short of TNNDT, but its usefulness here is in estimating the degree of
separation between Classes, in terms of the Mahalahobis distance
7
Using CVA in the R package “Morpho”.
21
N.J. Cook Journal of Wind Engineering & Industrial Aerodynamics 234 (2023) 105330
Fig. 25. Annual rate of occurrence of gust events ≥20kn in each Class.
22
N.J. Cook Journal of Wind Engineering & Industrial Aerodynamics 234 (2023) 105330
Fig. 27. Effect of instrumentation and quality control changes through the observation period.
23
N.J. Cook Journal of Wind Engineering & Industrial Aerodynamics 234 (2023) 105330
Fig. 28. Ratio of the mean sonic and cup observations for peak gusts over thresholds, ensemble averaged for all analysis stations.
9. Conclusions
Table 13
Average annual rate of XNNDT Classes at Analysis stations.
• The ASOS 1-min interval data are compromised by several types of
Class 0 1 2 3 4 5 6 artefacts specific to this dataset, additional to those commonly found
Average annual rate 22.4 1119 27.0 3.4 11.6 27.4 10.1 in meteorological observations.
• A calibration sequence is occasionally included in the cup-
anemometer data at some stations.
Table 14 • The sonic anemometers produce spurious high gust values in hail and
Confusion matrix for CVA applied to upper tail (y > 0) of Analysis set. freezing rain.
• Visual classification using timeseries of gust speed, direction, dry
CVA Class
bulb temperature, atmospheric pressure and precipitation reliably
Class 0 1 2 3 4 5 6
distinguishes five Classes of valid gust events and two Classes of
0 542 174 30 31 20 11 510
1 15 4567 39 17 28 15 9
artefacts.
2 17 301 88 22 23 25 9 • Most isolated and multiple Spike artefacts are due to instrumentation,
3 4 43 4 158 0 63 2 acquisition, or transmission glitches, with bird-generated gusts
4 12 30 27 4 967 173 3 adding only a minor contribution.
5 127 210 64 85 160 2092 47
• Methods that rely on only anemometric data do not sufficiently
6 214 128 7 4 3 24 2850
Error: 19.5% accurately classify ASOS gusts for later EVA.
• Methods relying on statistical metrics of the meteorological param
eters, e.g., gust factor, perform less well than machine-learning
methods operating directly on the meteorological timeseries.
• A sensitivity EVA analysis shows that removing residual artefacts in
the far upper tail (y > 1) reduces errors in predicted 50-year return
speeds, V50, to less than ±0.5kn. Missing valid values in this range
produce underestimates of V50, while introduced artefacts produce
overestimates.
• The TNNDT method operating directly on the timeseries of three
parameters: gust speed, dry bulb temperature and atmospheric
pressure, achieves a classification error of 2.4% for gust speeds
greater than the annual mode.
• The TNNDT method identifies artefacts of very unreliable stations
effectively, rendering them as useful as the reliable stations.
• The TNNDT method training can be weighted to different ranges of
Fig. 29. CVA dendrogram of TNNDT Class separation for upper tail (y > 0) of
value: here TNNDT to all gusts >40kn and XTNNDT to gusts greater
Analysis set.
24
N.J. Cook Journal of Wind Engineering & Industrial Aerodynamics 234 (2023) 105330
than the annual mode; the Class differences between the two administration, Resources, Software, Supervision, Validation, Visuali
weightings is 8.7%, with only 3.4% differing by more than one Class. zation, Writing – original draft, Writing – review & editing.
• It is proposed that gust speeds greater than the annual mode (y > 0)
should be checked for validity visually and against independent Funding statement
sources, e.g., SPC Storm Reports.
• There is strong correlation between rates of thermal events and of This is self-funded curiosity-driven research which did not receive
artefacts with station altitude, resulting in a geographical bias in any specific grant from funding agencies in the public, commercial, or
event frequency towards the Great Colorado Plateau. The Synoptic not-for-profit sectors.
Class is most frequent in the High Plains, longitudes − 105◦ to − 95◦ .
• Quality control Test 10, introduced in 2014, suppresses the reporting
of nearly all valid calms. It has little effect on the frequencies of valid Declaration of competing interest
events classified by TNNDT, but it culls the peak gust values. The
consequences of this require further investigation. The author declares that there are no known competing financial
• Estimates of the factor required to correct the peak gusts measured interests or personal relationships that could have appeared to influence
by the cup anemometer to the WMO standard 3s gust decreases the work reported in this paper.
almost linearly over the observed range, from 1.026 at 20kn to 1.006
at 50kn. Data availability
• The principal aim of distinguishing frontal events from T and NT
events has been achieved. Mendeley Data link provided. Further data on application.
• This present study and the earlier curation of the TD6405 database
(Cook, 2022) complete the data validation and classification steps Acknowledgements
required prior to render the ASOS 1-min interval data suitable for a
comprehensive study of extreme gust speeds across CONUS. The helpful and constructive comments and suggestions made by the
two reviewers of a previous draft paper, which was restricted in scope to
Credit author statement anemometric data only, and of the first draft of this paper are gratefully
acknowledged.The author also thanks Dr Brian Hempel, CIWRO, Uni
Nicholas J Cook: Conceptualization, Data curation, Formal analysis, versity of Oklahoma (https://ciwro.ou.edu/), for drawing attention to
Funding acquisition, Investigation, Methodology, Project the occasional calibration sequence illustrated in §8.3.2.
Supplementary information
The R scripts and instructions to extract gust events from any ASOS station and classify by the TNNDT method are available from Mendeley at URL:
https://doi.org/10.17632/88jp3swkn6.1. Rdata files of gust events ≥20kn at each the 450 ASOS stations of the Analysis set, classified by TNNDT and
XTNNDT, are also provided (118 Mb).
ICAO Code UTC (hours) Elevation (m) Sonic, date installed Latitude (degrees) Longitude (degrees) Station name
25
N.J. Cook Journal of Wind Engineering & Industrial Aerodynamics 234 (2023) 105330
References thunderstorm-like wind. J. Wind Eng. Ind. Aerod. 229, 105161 https://doi.org/
10.1016/j.jweia.2022.105161.
Lombardo, F.T., Zickar, A.S., 2019. Characteristics of measured extreme thunderstorm
Ahmed, M.R., El Damatty, A.A., Dai, K., Ibrahim, A., Lu, W., 2022. Parametric study of
near-surface wind gusts in the United States. J. Wind Eng. Ind. Aerod. 193, 103961
the quasi-static response of wind turbines in downburst conditions using a numerical
https://doi.org/10.1016/j.jweia.2019.103961.
model. Eng. Struct. 250, 113440 https://doi.org/10.1016/j.engstruct.2021.113440.
Lombardo, F.T., Main, J.A., Simiu, E., 2009. Automated extraction and classification of
Arul, M., Kareem, A., Burlando, M., Solari, G., 2022. Machine learning based automated
thunderstorm and non-thunderstorm wind data for extreme-value analysis. J. Wind
identification of thunderstorms from anemometric records using shapelet transform.
Eng. Ind. Aerod. 97, 120–131. https://doi.org/10.1016/j.jweia.2009.03.001.
J. Wind Eng. Ind. Aerod. 220, 104856 https://doi.org/10.1016/j.
Lombardo, F.T., Smith, D.A., Schroeder, D.L., Mehta, K.C., 2014. Thunderstorm
jweia.2021.104856.
characteristics of importance to wind engineering. J. Wind Eng. Ind. Aerod. 125,
Canepa, F., Burlando, M., Solari, G., 2020. Vertical profile characteristics of
121–132. https://doi.org/10.1016/j.jweia.2013.12.004.
thunderstorm outflows. J. Wind Eng. Ind. Aerod. 206, 104332 https://doi.org/
Lu, N.Y., Hawbecker, P., Basu, S., Manuel, L., 2019. On wind turbine loads during
10.1016/j.jweia.2020.104332.
thunderstorm downbursts in contrasting atmospheric stability regimes. Energies 12,
Castillo, E., 1988. Extreme Value Theory in Engineering. Academic Press, ISBN 0-12-
2773. https://doi.org/10.3390/en12142773.
163475-2, p. 389.
Mahalanobis, P.C., 1936. On the generalised distance in statistics. Proc. Natl. Inst. Sci.
Chen, G., Lombardo, F.T., 2020. An automated classification method of thunderstorm
India 2, 49–55. http://library.isical.ac.in:8080/xmlui/bitstream/handle/10263/67
and non-thunderstorm wind data based on a convolutional neural network. J. Wind
65/Vol02_1936_1_Art05-pcm.pdf.
Eng. Ind. Aerod. 207, 104407 https://doi.org/10.1016/j.jweia.2020.104407.
NCDC, 2006a. Data Documentation for Data Set 6405: ASOS Surface 1-minute, Page 1
Choi, E.C.C., 1999. Extreme wind characteristics over Singapore – an area in the
Data. July. National Climatic Data Center, Asheville NC, USA, p. 5. Available at: htt
equatorial belt. J. Wind Eng. Ind. Aerod. 83, 61–69. https://doi.org/10.1016/S0167-
ps://www.ncei.noaa.gov/pub/data/asos-onemin/td6405.txt.
6105(99)00061-6.
NCDC, 2006b. Data Documentation for Data Set 6406: ASOS Surface 1-minute, Page 2
Choi, E.C.C., 2004. Field measurement and experimental study of wind speed profile
Data. July. National Climatic Data Center, Asheville NC, USA, p. 5. Available at: htt
during thunderstorms. J. Wind Eng. Ind. Aerod. 92, 275–290. https://doi.org/
ps://www.ncei.noaa.gov/pub/data/asos-onemin/td6406.txt.
10.1016/j.jweia.2003.12.001.
NOAA, 2013. Primer for the ASOS Software Version 3.10 Ice Free Wind Sensor Quality
Choi, E.C.C., Hidayat, F.A., 2002. Gust factors for thunderstorm and non-thunderstorm
Control Algorithm. July 24. NOAA. Last accessed 13 November 2022: https://www.
winds. J. Wind Eng. Ind. Aerod. 90, 1683–1696. https://doi.org/10.1016/S0167-
weather.gov/media/asos/ASOS%20Implementation/IFWS%20QC%20Algorithm
6105(02)00279-9.
_primer.pdf.
Cook, N.J., 2014a. Review of errors in archived wind data. Weather 69, 72–81. https://
Orwig, K.D., Schroeder, J.D., 2007. Near-surface wind characteristics of extreme
doi.org/10.1002/wea.2148, 3.
thunderstorm outflows. J. Wind Eng. Ind. Aerod. 95, 565–584. https://doi.org/
Cook, N.J., 2014b. Detecting artefacts in analyses of extreme wind speeds. Wind Struct.
10.1016/j.jweia.2006.12.002.
19, 271–294. https://doi.org/10.12989/was.2014.19.3.271.
Samanta, S., Tyagi, B., Vissa, N.K., Sahu, R.K., 2020. A new thermodynamic index for
Cook, N.J., 2021. Locating the anemometers of the US ASOS network and classifying
thunderstorm detection based on cloud base height and equivalent potential
their local shelter. Weather wea 4131. https://doi.org/10.1002/wea.4131.
temperature. J. Atmos. Sol. Terr. Phys. 207, 105367 https://doi.org/10.1016/j.
Cook, N.J., 2022. Curating the TD6405 database of 1-minute interval wind observations
jastp.2020.105367.
across the USA for use in Wind Engineering studies. J. Wind Eng. Ind. Aerod. 224,
Shu, Z.R., Li, Q.S., He, Y.C., Chan, P.W., 2017. Vertical wind profiles for typhoon,
104961 https://doi.org/10.1016/j.jweia.2022.104961.
monsoon and thunderstorm winds. J. Wind Eng. Ind. Aerod. 168, 190–199. https://
De Gaetano, P., Repetto, M.P., Repetto, T., Solari, G., 2014. Separation and classification
doi.org/10.1016/j.jweia.2017.06.004.
of extreme wind events from anemometric records. J. Wind Eng. Ind. Aerod. 126,
Simiu, E., Heckert, N.A., 1996. Extreme wind distribution tails: a “Peaks over Threshold”
132–143. https://doi.org/10.1016/j.jweia.2014.01.006.
approach. J. Struct. Eng. ASCE 122, 539–547. https://doi.org/10.1061/(ASCE)0733-
Gomes, L., Vickery, B.J., 1978. Extreme wind speeds in mixed wind climates. J. Wind
9445(1996)122:5(539).
Eng. Ind. Aerod. 2, 331–344. https://doi.org/10.1016/0167-6105(78)90018-1.
Solari, G., Burlando, M., Repetto, M.P., 2020. Detection, simulation, modelling and
Gringorten II., 1963. A plotting rule for extreme probability paper. J. Geophys. Res. 68,
loading of thunderstorm outflows to design wind-safer and cost-efficient structures.
813–814.
J. Wind Eng. Ind. Aerod. 200, 104142 https://doi.org/10.1016/j.
Guerova, G., Dimitrova, T., Georgiev, S., 2019. Thunderstorm classification functions
jweia.2020.104142.
based on instability indices and GNSS IWV for the sofia plain. Rem. Sens. 11, 2988.
Twisdale, L.A., Vickery, P.J., 1992. Research on thunderstorm wind design parameters.
https://doi.org/10.3390/rs11242988.
J. Wind Eng. Ind. Aerod. 41, 545–556. https://doi.org/10.1016/0167-6105(92)
Gumbel, E.J., 1958. Statistics of Extremes. Columbia University Press, New York, ISBN 0-
90461-I.
231-02190-9, p. 371.
Vallis, M.B., Loredo-Souza, A.M., Ferriera, V., de Lima Nascimento, E., 2019.
Gunter, W.S., Schroeder, J.L., 2015. High-resolution full-scale measurements of
Classification and identification of synoptic and non-synoptic extreme wind events
thunderstorm outflow winds. J. Wind Eng. Ind. Aerod. 138, 13–26. https://doi.org/
from surface observations in South America. J. Wind Eng. Ind. Aerod. 193, 103963
10.1016/j.jweia.2014.12.005.
https://doi.org/10.1016/j.jweia.2019.103963.
Harris, R.I., 2009. XIMIS, a penultimate extreme value method suitable for all types of
Vickery, P.J., Masters, F.J., Powell, M.D., Wadhera, D., 2009. Hurricane hazard
wind climate. J. Wind Eng. Ind. Aerod. 97, 271–286. https://doi.org/10.1016/j.
modeling: the past, present, and future. J. Wind Eng. Ind. Aerod. 97, 392–405.
jweia.2009.06.011.
https://doi.org/10.1016/j.jweia.2009.05.005.
Hotelling, H., 1933. Analysis of a complex of statistical variables into principal
Xhelaj, A., Burlando, M., Solari, G., 2020. A general-purpose analytical model for
components. J. Educ. Psychol. 24, 417–441. https://doi.org/10.1037/h0071325.
reconstructing the thunderstorm outflows of travelling downbursts immersed in ABL
Huang, G., Jiang, Y., Peng, L., Solari, L., Liao, H., Li, M., 2019. Characteristics of intense
flows. J. Wind Eng. Ind. Aerod. 207, 104373 https://doi.org/10.1016/j.
winds in mountain area based on field measurement: focusing on thunderstorm
jweia.2020.104373.
winds. J. Wind Eng. Ind. Aerod. 190, 166–182. https://doi.org/10.1016/j.
Ye, L., Keogh, E., 2011. Time series shapelets: a novel technique that allows accurate,
jweia.2019.04.020.
interpretable and fast classification. Data Min. Knowl. Discov. 22, 149–182. https://
IAWE, 2011. Announcement of the Alan G. Davenport wind loading chain. J. Wind Eng.
doi.org/10.1007/s10618-010-0179-5.
Ind. Aerod. 99, 998–999. https://doi.org/10.1016/j.jweia.2011.09.005.
Zhang, Y., Sarkar, P.P., Hu, H., 2015. An experimental investigation on the
Kasperski, M., 2002. A new wind zone map of Germany. J. Wind Eng. Ind. Aerod. 90,
characteristics of fluid–structure interactions of a wind turbine model sited in
1271–1287. https://doi.org/10.1016/S0167-6105(02)00257-X.
microburst-like winds. J. Fluid Struct. 57, 206–218. https://doi.org/10.1016/j.
Kristensen, L., Frost Hansen, O., 2002. Distance constant of the Risø cup anemometer.
jfluidstructs.2015.06.016.
Forskingcenter Risoe. Riso-R No. 1320 (EN), 23. https://orbit.dtu.dk/en/publication
Zhang, S., Solari, G., Burlando, M., Yang, Q., 2019a. Directional decomposition and
s/distance-constant-of-the-ris%C3%B8-cup-anemometer.
properties of thunderstorm outflows. J. Wind Eng. Ind. Aerod. 189, 71–90. https://
Kwon, D.K., Kareem, A., Butler, K., 2012. Gust-front loading effects on wind turbine
doi.org/10.1016/j.jweia.2019.03.014.
tower systems. J. Wind Eng. Ind. Aerod. 104–106, 109–115. https://doi.org/
Zhang, S., Yang, Q., Solari, G., Li, B., Huang, G., 2019b. Characteristics of thunderstorm
10.1016/j.jweia.2012.03.030.
outflows in Beijing urban area. J. Wind Eng. Ind. Aerod. 195, 104011 https://doi.
Li, X., Li, S., Su, Y., Peng, L., Cao, S., Liu, M., 2022. Study on the time-varying extreme
org/10.1016/j.jweia.2019.104011.
value characteristic of the transient loads on a 5:1 rectangular cylinder subjected to a
26