Plain English: What This Paper Is Actually About
The Core Problem
Every rain gauge on Earth reads too low. Not randomly — systematically. The biggest reason by far is wind. When snow falls in wind, the air flowing over the gauge orifice deflects snowflakes away before they can fall in. At 6 m/s, some unshielded gauges miss more than half of all snowfall. You can't see this error in the recorded number — it just looks like less snow fell.
Smaller problems: water sticking to gauge walls that never gets counted (wetting loss), and water evaporating before observation (evaporation loss). Rain in wind is also affected, just far less severely than snow.
Why It Matters
Global precipitation datasets are built from these gauges. Hydrological models, Arctic water balance studies, river runoff forecasts — they're all working with numbers that are systematically too low. In snowy high-latitude regions, which cover roughly half the Northern Hemisphere land area, the bias is severe enough to fundamentally distort water budget calculations.
The WMO Intercomparison (1986–1993)
The WMO ran a massive field experiment using a special "ground truth" reference gauge called the Double Fence International Reference (DFIR) — a gauge protected by concentric wind-fence rings that catch nearly everything. Each country put its standard gauge next to a DFIR, measured thousands of snow events side by side, and developed equations describing how much each gauge type misses as wind increases. These equations (21 of them, covering the US, Russia, Norway, Japan, China, Finland, etc.) form the backbone of this paper.
What Adam & Lettenmaier Built
They took those equations and applied them to 5 years of daily weather station data (1994–1998) from 7,878 stations worldwide. For each station, each day: given the wind speed, temperature, and gauge type here, how much precipitation was missed? They summed these daily corrections into monthly factors, then gridded them globally at ½° resolution (~55 km). The result: 12 monthly maps showing how much to multiply any precipitation record by to correct for known biases.
The Three Correction Types
| Correction | Annual | DJF (Winter) | JJA (Summer) | Priority |
|---|---|---|---|---|
| Wind + Snow | +4.5% | +8.6% | +1.0% | ★★★ Dominant |
| Wind + Rain | +3.9% | +3.7% | +3.6% | ★★ Moderate |
| Wetting Losses | +2.8% | +2.9% | +2.7% | ★ Uniform |
| All Combined | +11.2% | +15.2% | +7.3% | — |
Honest Limitations Acknowledged
- Only 5 years of wind data available (1994–1998); corrections are climatological averages
- USA gets the lowest reliability score (30/70) due to gauge heterogeneity and airport wind bias
- Zero snow depth assumed — ignoring the effect of accumulated snow raising effective gauge height
- All US stations assumed unshielded — some have Alter shields, causing slight overcorrection there
- All anemometers assumed at 10m, fully exposed — not always true
- Cannot be used for climate trend detection — bias corrections are climatological, not year-specific
Key Numbers & Results
Comparison vs. Yang et al. Station Studies
Yang et al. performed station-specific corrections with full metadata (actual gauge heights, wind sensor heights, shielding status) for Alaska, Siberia, and Greenland. Adam's gridded estimates are consistently slightly higher (less corrected) because assumptions rather than exact metadata were used.
| Region | Stations | Mean Diff. | Std. Dev. | Key Driver |
|---|---|---|---|---|
| Siberia | 58 | +1.6% (Adam higher CR) | 4.5% | Liquid undercatch eq. difference |
| Greenland | 12 | +2.5% | 6.0% | Wind sensor height assumption |
| Alaska (all) | 9 | +3.5% | 12.0% | Shield status unknown — 2 Alter-shielded |
| Alaska (unshielded only) | 7 | +7.9% | 5.8% | Gauge/anemometer height assumptions |
Comparison vs. Legates & Willmott (1990)
Adam's dataset shows less warm-season precipitation than Legates & Willmott (evaporation loss not included here) but significantly more cold-season precipitation — because the WMO NWS 8" equation (Eq. 2-17) produces catch ratios up to 140% lower at 6.5 m/s than Legates & Willmott's pre-WMO equivalent.
Spatial Patterns
- Catch ratios generally increase north→south (less snow = less undercatch)
- Lower in Alps and Tibetan Plateau (high wind)
- Lower in US Midwest (high mean wind speeds)
- North America shows discontinuity at US/Canada border due to different measurement methods
- Southern South America anomaly (+20–80% liquid correction) — artifact of very few stations representing extreme winds
Modern Updates: WMO-SPICE & Post-2002
Adam & Lettenmaier (2003) is still widely cited and its equations remain the standard for correcting manual gauges. The WMO-SPICE experiment (2010–2015) extended the framework to modern automated gauges. These two frameworks are complementary, not competing — use Goodison et al. (1998) for NWS 8", Tretyakov, Nipher, Hellmann etc.; use SPICE/Kochendorfer for Geonor T-200B, OTT Pluvio2, and tipping-bucket automated gauges.
WMO-SPICE (2010–2015)
The WMO Solid Precipitation Intercomparison Experiment compared automated precipitation gauges across 8+ sites during the 2013/14 and 2014/15 winters. The new reference gauge is the Double Fence Automated Reference (DFAR).
Kochendorfer et al. (2017) Universal Transfer Function (UTF)
Two functional forms derived from 8 sites, applicable to single Alter-shielded and unshielded automated gauges:
CE = collection efficiency (equivalent to CR, 0–1) · U_gh = wind speed at gauge height (m/s) · T_air = air temperature (°C)
a, b, c = site-specific or universal coefficients from Table 1 of Kochendorfer et al. (2017b)
Requires user-determined precipitation phase. Coefficients differ by shield configuration and wind measurement height (gauge height vs. 10m).
Key SPICE Finding
Application of UTF reduced bias in unshielded gauges from −33.4% to +1.1% on average. However, less effective at the windiest sites, and some over-adjustment at calm sites — reinforcing that good wind shielding remains preferable to large software corrections.
Current Global Products and Their Methods
| Product | Correction Approach | Phase Determination | Notes |
|---|---|---|---|
| GPCP | Fixed monthly climatologies (Legates & Willmott basis) | Monthly empirical | Adam & Lettenmaier lineage |
| GPCC Monitoring | Dynamic (Fuchs-Schneider model, CF-F) | Event-based RH + T | Uses real-time GTS observations |
| ECCC Canada | SPICE UTF (Kochendorfer 2017b) | Temperature-continuous | 397 automated stations, 2001–2019 |
Ehsani & Behrangi (2022) found a ~4% difference in global land precipitation between GPCP and GPCC correction approaches. The community debate is ongoing.
What Remains Valid from Adam & Lettenmaier (2003)
| Element | Still Valid? | Notes |
|---|---|---|
| WMO 1998 catch ratio equations (Table 2-1) | ✓ Yes | Not superseded for manual gauge types |
| Log wind profile scaling (Eq. 3-4) | ✓ Yes | Standard boundary-layer similarity |
| Temperature-based phase partitioning | ~ Acceptable | Wet-bulb methods more accurate if available |
| Wetting loss values (Table 3-3) | ~ Use with caution | Bogdanova & Mestcherskaya suggest some values too high |
| Reference period 1979–1998 | ✗ Update needed | Use ERA5 (1940–present) for modern applications |
| Trend detection use | ✗ Not applicable | Authors explicitly prohibit this use case |
Core Bias Adjustment Model
Full Adjustment Equation (Adam & Lettenmaier Eq. 3-2)
| Symbol | Meaning | Units |
|---|---|---|
| Pa | Bias-adjusted precipitation | mm |
| Pg | Gauge-measured precipitation | mm |
| R | Solid precipitation fraction (0–1) | dimensionless |
| CRs | Catch ratio for solid precip — from WMO equations | 0–1 (or %) |
| κr | Correction factor for liquid undercatch (≥1) | dimensionless |
| ΔPwr | Wetting loss, liquid events | mm/day |
| ΔPws | Wetting loss, solid events (= 0.5 × liquid loss) | mm/day |
CR and κ are inverses. The WMO intercomparison uses catch ratio CR = gauge/truth (≤1 for undercatch). Legates (1987) used correction factor κ = truth/gauge (≥1). Adam & Lettenmaier use CR for solid, κ for liquid, following the conventions of their respective source literature. Don't mix them up.
Mean Monthly Aggregate Catch Ratio (Eq. 3-3)
Snow/Rain Partitioning
Daily precipitation is split into solid (snow) and liquid (rain) fractions using the US Army Corps of Engineers (1956) method based on daily temperature extremes.
| Condition | Precipitation Type | Solid Fraction R |
|---|---|---|
| T_min > +1.5°C | All liquid (rain) | 0.0 |
| T_max < −0.5°C | All solid (snow) | 1.0 |
| −0.5°C ≤ T_max and T_min ≤ +1.5°C | Mixed — interpolate | (1.5 − T_min) / 2.0 |
Yang et al. used −2°C and +2°C as thresholds instead of −0.5°C and +1.5°C. This methodological difference contributes to slight catch ratio divergence between the two approaches — Adam's estimates are consistently 1.6–7.9% higher than Yang's. For transitional climates (Pennsylvania spring/fall), threshold choice meaningfully affects how much precipitation gets the snow correction.
Pennsylvania Context
For central PA (MDT, CXY, LNS), solid precipitation occurs primarily November–March. The transitional period (October, April) is where partitioning assumptions matter most. At MDT, the documented TMIN warm bias means mixed events may be over-classified as liquid — reducing the solid undercatch correction applied to MDT relative to a truly unbiased temperature record.
Wind Profile Scaling to Gauge Height
The WMO catch ratio equations require wind speed at the gauge orifice, not at the standard anemometer height. The logarithmic similarity profile is used.
| Symbol | Definition | Value Used |
|---|---|---|
| w_h | Wind speed at gauge orifice height | Computed |
| w_H | Wind speed at anemometer | Observed (ASOS or archive) |
| h | Gauge orifice height above ground | Gauge-specific (see Table) |
| H | Anemometer height | Assumed 10 m (WMO standard) |
| z0 | Roughness length | 0.01 m (Oct–Mar) / 0.03 m (Apr–Sep) |
| m | Exposure coefficient | Assumed 1.0 (fully exposed) |
Snow Depth Effect on Effective Heights
Accumulated snow raises the effective ground level, reducing both h and H. This is ignored in this study (zero depth assumed). Sensitivity analysis shows:
| Snow Depth | CR Change (high wind) | Primary Driver |
|---|---|---|
| 0 m (assumed) | Baseline | — |
| 0.5 m | +~10% (less undercatch) | Gauge height reduction |
| 1.0 m | +~30% | Gauge height reduction (anemometer height change is negligible) |
NWS 8" gauge orifice height: 1.1 m. At heavy snow depths (>0.5 m), the effective orifice is much closer to the actual snow surface, and the wind speed at orifice becomes substantially higher per the log profile — increasing the correction. This is a known underestimation in the Adam & Lettenmaier approach for deep-snowpack environments.
WMO Solid Precipitation Catch Ratio Equations
Source: Goodison et al. (1998), Table 2-1. All equations give CR in %. Variables: w_h = gauge-height wind (m/s), T_max, T_min, T_mean in °C.
If computed gauge-height wind exceeds threshold, use threshold value. This prevents extrapolation beyond regression range and avoids overcorrection when snow may be blown INTO the gauge. Default: 6.5 m/s. Exceptions: North Korea and Austria → 6.0 m/s (regression behaves unrealistically beyond this).
Wind-Induced Liquid Precipitation Undercatch
The correction factor κ_r (≥1) multiplies gauge-measured liquid precipitation. All from Legates (1987), compiled from multiple sources. Variables: μ = transfer coefficient, w_hp = wind speed during precipitation at gauge height, w_p = at anemometer height.
Transfer Coefficient μ (Eq. 3-5)
Vapor Pressure Estimator (Eq. 3-6)
κ_r Equations by Gauge Type
Wind Speed During Precipitation (Bogdanova 1969, via Sevruk 1982)
Wetting Losses
Added for every day precipitation occurs. Reflects water adhering to gauge interior walls during and after precipitation. Solid precipitation wetting losses are half of liquid losses.
| Gauge | a (mm/day) | Source |
|---|---|---|
| NWS 8" | 0.15 | Legates (1987) |
| Australian | 0.02 | Legates (1987) |
| Wild | 0.20 | Legates (1987) |
| R.M.O. Mk 2 | 0.20 | Legates (1987) |
| Chinese/Japanese | 0.20* | *Interpolated |
| L'Association | 0.20 | Legates (1987) |
| Nipher | 0.25 | Legates (1987) |
| R.M.O. Mk 1 | 0.25 | Legates (1987) |
| South African | 0.25* | *Interpolated |
| 0/200 cm² generic | 0.25 | Legates (1987) |
| Kostlivi | 0.25* | *Interpolated |
| SMHI | 0.30 | Legates (1987) |
| Hellmann | 0.30 | Legates (1987) |
| Metra 886 | 0.30 | Legates (1987) |
| Tretyakov (full) | 0.30 | Legates (1987) |
| Tretyakov (Former USSR only) | 0.10 additional | FSU applies 0.20/0.10 operationally; only 0.10 more added |
Bogdanova & Mestcherskaya (1998) found that increasing measurement frequency in Former USSR (2×/day → 4×/day in 1966) did NOT increase daily wetting losses — on humid days containers don't fully dry between measurements. This suggests some historical wetting loss estimates may be too high. Groisman & Rankova (2001) confirmed. Use Table values as order-of-magnitude estimates; site-specific studies are preferable when available.
Country Gauge Parameters Table
Table 3-1: Parameters applied when computing daily CRs from WMO equations.
| Country/Region | Gauge Corrected For | Orifice Ht (m) | Wind Threshold (m/s) | Eq. # |
|---|---|---|---|---|
| Former USSR | Tretyakov (shielded) | 2.0 | 6.5 | 2-4 |
| USA | NWS 8" (unshielded) | 1.1 | 6.5 | 2-17 |
| Mongolia | Tretyakov (shielded) | 2.0 | 6.5 | 2-4 |
| Sweden | SMHI (shielded) | 1.5 | 6.5 | 2-3 |
| Norway | SMHI (shielded) | 1.5 | 6.5 | 2-3 |
| Greenland | Hellmann (shielded) | 3.0 | 6.5 | 2-9 |
| Iceland | Hellmann (shielded) [surrogate] | 2.0 | 6.5 | 2-9 |
| Japan | RT-4 (cylindrical shielded) | 3.5 | 6.5 | 2-13 |
| Finland | H&H-90 (Tretyakov shielded) | 1.5 | 6.5 | 2-20 |
| Poland | Hellmann (unshielded) | 1.5 | 6.5 | 2-7 |
| Romania | Hellmann (unshielded) [surrogate for IMC] | 1.5 | 6.5 | 2-7 |
| Switzerland | Hellmann (unshielded) | 1.5 | 6.5 | 2-7 |
| Czech./Slovakia | METRA 886 → then Tretyakov | 1.0 | 6.5 | 2-21 → 2-4 |
| North Korea | Tretyakov (unshielded) | 1.5 | 6.0 | 2-4 |
| China | Chinese (unshielded) | 0.7 | 6.5 | 2-14 |
| South Korea | Chinese (unshielded) | 0.2 | 6.5 | 2-14 |
| Austria | Wild (shielded) [surrogate] | 1.0 | 6.0 | 2-19 |
| Canada | Handled separately — see Canada section | — | ||
Country Reliability Scores
Scoring system on a 70-point scale assessing accuracy of the wind-induced solid precipitation undercatch adjustment. Points: Gauge Representation (0–20) + Equation Application (0–30) + Interpolation/Station Density (0–20).
Score Criteria
| Category | Max Points | Criteria |
|---|---|---|
| Gauge Representation | 20 | How well does assumed national gauge represent actual network? |
| Equation Application | 30 | −5 if N<100; −5 if r²<0.70; −5 if equation is a surrogate |
| Station Density | 20 | +10 if <5,000 km²/gauge; +10 more if <2,500 km²/gauge |
| Country | Density (km²/gauge) | Gauge Rep. | Eq. Appl. | Interpolation | Total / 70 |
|---|---|---|---|---|---|
| Canada | 1,491 | 20 | 25 | 15 | |
| Sweden | 1,601 | 15 | 25 | 20 | |
| South Korea | 1,824 | 20 | 20 | 20 | |
| Norway | 2,014 | 10 | 25 | 20 | |
| Poland | 4,667 | 15 | 30 | 10 | |
| Romania | 1,389 | 15 | 20 | 20 | |
| North Korea | 4,305 | 20 | 25 | 10 | |
| Switzerland | 1,007 | 5 | 30 | 20 | |
| Former USSR | 13,791 | 20 | 25 | 0 | |
| Mongolia | 30,686 | 20 | 25 | 0 | |
| Japan | 1,749 | 5 | 20 | 20 | |
| China | 12,529 | 20 | 20 | 0 | |
| Former Czech./Slovakia | 2,241 | 10 | 10 | 20 | |
| Greenland | 10,095 | 15 | 20 | 0 | |
| Iceland | 2,711 | 15 | 10 | 10 | |
| Austria | 655 | 5 | 10 | 20 | |
| USA ← LOWEST | 9,816 | 5 | 25 | 0 | |
| Finland | 6,241 | 10 | 20 | 0 |
Canada: Special Case
Canada uses a snow ruler (not a gauge) to measure fresh snowfall at ~1,800 stations, converting depth to SWE using an assumed density of 100 kg/m³. A shielded Nipher gauge is used at only ~125 stations. This means the standard catch ratio framework doesn't apply — Canadian "catch ratios" are actually adjustment ratios (archived measurement over ground truth), not traditional gauge-to-DFIR catch ratios.
Two Source Datasets Blended
| Feature | Groisman (1998b) | Mekis & Hogg (1999) |
|---|---|---|
| Stations | 6,692 | 495 (best quality) |
| Timestep | Monthly | Daily |
| Snow correction | Climatological density + CR=0.90 assumed | WMO equations + observed wind |
| Trace correction | None | 0.1 mm per trace event |
| Accuracy | Lower; wider network | Higher; gold standard |
Blending Approach
- Find 485 stations common to both datasets
- Compute monthly ratio: Groisman/Mekis for 1979–1990
- Grid ratios to ½° using SYMAP
- Scale full Groisman network to Mekis quality using gridded ratios
- Derive catch ratios: CR = unadjusted / scaled-Groisman
Canadian catch ratios can exceed 100% in the Rocky Mountains. This isn't an error — it means the snow ruler was overestimating SWE because fresh mountain snow is less dense than the assumed 100 kg/m³. The bias correction actually reduces the archived value in these regions.
Catch Ratio Calculator
Compute the solid precipitation catch ratio for a given gauge type and wind conditions, following the WMO equations. Results are in %.
Input Parameters
Results
CR vs. Wind Speed Curves
Interactive visualization of catch ratio degradation with increasing wind speed for major gauge types. Wind speed shown at gauge height. Assumes pure snow conditions.
Symbol Reference
| Symbol | Meaning | Units | Equation(s) |
|---|---|---|---|
| Pa | Bias-adjusted precipitation | mm | 3-1, 3-2 |
| Pg | Gauge-measured precipitation | mm | 3-1, 3-2, 3-3 |
| R | Solid precipitation fraction | 0–1 | 3-1, 3-2 |
| CR | Catch ratio (gauge/truth) | % or 0–1 | 2-1 through 2-21 |
| CRs | Catch ratio for solid precipitation | fraction | 3-2 |
| CR_all | Aggregate mean monthly catch ratio | fraction | 3-3 |
| κ | Correction factor (truth/gauge) = 1/CR | ≥1 | 3-1 |
| κr | Correction factor, liquid precipitation | ≥1 | 3-7 through 3-15 |
| w_h | Wind speed at gauge orifice height | m/s | 3-4, 2-1–2-21 |
| w_H | Wind speed at anemometer height | m/s | 3-4 |
| w_p | Wind during precipitation (anem. height) | m/s | 3-15, 3-16 |
| w_hp | Wind during precipitation (gauge height) | m/s | 3-7–3-14 |
| h | Gauge orifice height above ground | m | 3-4 |
| H | Anemometer height (assumed 10 m) | m | 3-4 |
| z0 | Roughness length (0.01/0.03 m) | m | 3-4 |
| m | Exposure coefficient (assumed 1.0) | — | 3-4 |
| T_max | Daily maximum temperature | °C | 2-2, 2-4, 2-5 |
| T_min | Daily minimum temperature | °C | 2-2, 2-5 |
| T_mean | Daily mean temperature | °C | 2-8 |
| T_a | Monthly mean air temperature | °C | 3-5, 3-6 |
| μ | Transfer coefficient (thermodynamic) | — | 3-5, 3-7–3-12 |
| e_a | Monthly mean vapor pressure | kPa | 3-6 |
| ΔPwr | Wetting losses, liquid | mm | 3-2 |
| ΔPws | Wetting losses, solid (= 0.5·a) | mm | 3-2 |
| a | Gauge wetting loss per event | mm/day | Table 3-3 |
| Lr | Empirical coeff., wind during precip | — | 3-16, 3-17 |
| M | Precipitation days per month | days | 3-17 |
| N | Observations in WMO regression | count | Table 2-1 |
| r² | Coefficient of determination | 0–1 | Table 2-1 |
| DFIR | Double Fence International Reference | — | WMO reference standard |
| DFAR | Double Fence Automated Reference | — | WMO-SPICE reference standard |
| UTF | Universal Transfer Function (SPICE) | — | Kochendorfer et al. (2017) |
NEXRAD Beam Geometry
Surface precipitation gauges and NEXRAD radar measure precipitation in completely different volumes of the atmosphere. Understanding where the radar beam is relative to the precipitating cloud is essential for interpreting any gauge–radar comparison, and directly affects how bias corrections from Adam & Lettenmaier should be applied alongside radar-derived fields.
The Standard Beam Height Equation
The height of the center of the radar beam above ground at range r is:
| Symbol | Meaning | Value |
|---|---|---|
| H(r) | Beam center height above ground at range r | km |
| r | Slant range from radar | km |
| Re | True Earth radius | 6,371 km |
| ke | Effective Earth radius factor (standard atmosphere) | 4/3 ≈ 1.333 |
| θ | Elevation angle | radians |
| h_s | Radar site elevation above MSL | km |
Beam Width
Beam Bottom and Top
Key Physical Consequences
| Range from Radar | Approx. Beam Center (0.5° tilt) | Consequence |
|---|---|---|
| 25 km | ~0.5 km AGL | Beam is in rain layer — good QPE |
| 50 km | ~1.0 km AGL | Still in rain for most events |
| 100 km | ~2.0 km AGL | Near or above bright band in weak events |
| 150 km | ~3.3 km AGL | Frequently above melting layer in winter |
| 200 km | ~5.0 km AGL | Almost always overshooting in PA winter events |
| 250 km+ | >7 km AGL | Severe beam overshoot — QPE unreliable |
Gauge undercatch (corrected by Adam & Lettenmaier methods) and radar beam overshoot produce opposite biases in the same direction — both cause the apparent surface precipitation to be lower than actual. When comparing gauge records to NEXRAD QPE at long ranges, the radar may itself be underestimating, masking or compounding the gauge undercatch signal. Never apply gauge bias corrections to improve gauge-radar agreement without first verifying the beam is actually sampling the precipitating layer.
Beam Zone Classification
Precipitation observations can be classified by which portion of the radar scanning geometry they fall in. This determines how much confidence to place in radar QPE for a given station, and whether gauge undercatch corrections are the dominant error source or whether beam geometry effects dominate.
Four-Zone Framework
| Zone | Criteria | Radar Quality | Dominant Bias Source |
|---|---|---|---|
| Zone 1: Beam Below ML | Beam center < melting level; beam bottom < ML − 500m | High — sampling liquid precipitation directly | ZR relationship error; gauge undercatch |
| Zone 2: Bright Band | Beam intersects melting layer (ML ± ~500m) | Degraded — bright band enhancement inflates Z | Bright band contamination; mixed-phase ZR |
| Zone 3: Partial Overshoot | Beam bottom above ML; beam still sampling precipitation | Poor — ice crystal sampling, Z-to-snow relationship | Ice-phase ZS relationship; beam overshoot |
| Zone 4: Complete Overshoot | Beam entirely above precipitation top | None — no signal or noise floor only | Radar cannot detect event at all |
Melting Layer Height Estimation
Seasonal Patterns for Pennsylvania
| Season | Typical ML Height (AGL) | Zone 1 Radius (0.5° tilt) | Notes |
|---|---|---|---|
| Summer (JJA) | 3.5–5.0 km | ~200–250 km | Beam usually in rain layer for all PA coverage |
| Spring/Fall | 1.5–3.5 km | ~100–175 km | Transitional; check soundings case-by-case |
| Winter (DJF) | 0.3–1.5 km | ~25–75 km | Beam overshoot dominates beyond ~75 km |
| Cold precip events | <0.5 km (surface) | ~0–25 km | NEXRAD nearly useless for QPE; gauge is only truth |
In Pennsylvania winter events, NEXRAD stations (KCCX, KPBZ, KDOX, KBGM, etc.) are frequently operating in Zone 3 or 4 for large fractions of their coverage area. The gauge record — even with its wind-induced undercatch — is often the more accurate of the two measurements. This is why bias-corrected gauge networks (CoCoRaHS, COOP, ASOS) remain the authoritative truth for winter precipitation QPE verification in PA.
Five-Point Spatial Integration Method
A single beam height calculation at a station's exact coordinates is insufficient for representing the spatial variability of radar beam geometry across a ½° grid cell. The five-point integration method samples the beam geometry at five locations within a grid cell and averages the result, providing a spatially representative beam height estimate consistent with the resolution of the bias correction grids.
Sampling Pattern
Why Five Points Instead of One
| Scenario | Single-Point Error | Five-Point Error |
|---|---|---|
| Near-range, flat terrain | <50 m | <20 m |
| Mid-range, ridge-valley | 200–800 m | 100–300 m |
| Far-range, complex terrain | 500–2000 m | 200–600 m |
| Transition zone (beam near ML) | May misclassify zone | Better zone boundary estimate |
Terrain Correction
In mountainous or ridge-valley terrain (Appalachians, Ridge & Valley province), ground elevation varies significantly within a single ½° grid cell. The beam height above ground at each of the five points must account for local terrain elevation:
Application to PA NEXRAD Sites
| NEXRAD Site | Elev. (m MSL) | Location | Key Coverage Challenge |
|---|---|---|---|
| KCCX | 733 | State College (Bald Eagle Mtn) | High elevation; near-range beam very low — good winter coverage |
| KPBZ | 361 | Pittsburgh | Western PA; good coverage but ridge blockage to east |
| KDOX | 15 | Dover DE (coastal) | Low elevation; overshoot over central/western PA |
| KBGM | 490 | Binghamton NY | Northern PA coverage; moderate terrain effects |
| KBUF | 211 | Buffalo NY | NW PA; lake-effect snow geometry issues |
| KAKQ | 34 | Norfolk VA | SE PA only; severe overshoot over interior |
| KENX | 557 | Albany NY | NE PA fringe; complex terrain |
| KRLX | 381 | Charleston WV | SW PA tip only; limited impact |
Pennsylvania Station Network Context
Pennsylvania's geography creates systematic spatial gradients in both gauge undercatch and radar beam quality. Understanding these gradients is essential for interpreting any PA precipitation climatology.
Physiographic Provinces and Precipitation Bias
| Province | Key Stations | Undercatch Risk | Radar Quality (Winter) | Notes |
|---|---|---|---|---|
| Ridge & Valley | LNS, SEG, IPT | Moderate–High (wind channeling) | Variable (terrain blockage) | Valley stations miss upslope snow; ridge gauges experience high winds |
| Allegheny Plateau | JST, DUJ, IPT | High (elevation + wind) | Poor (far from KCCX at high elevation) | Heaviest snowfall region; highest undercatch corrections expected |
| Piedmont / SE | PHL, ABE, RDG | Low (mild, less snow) | Good (near KDOX/KPBZ range) | Least affected by solid undercatch |
| Susquehanna Valley | MDT, CXY, THV | Moderate (mixed precip season) | Moderate–good (KCCX coverage) | River moisture enhancement real (MDT vs CXY ~11% bias confirmed physical) |
| Lake Erie Corridor | ERI, BFD | Very High (lake-effect snow) | Poor (lake-effect shallow, high wind) | Lake-effect snow is notoriously difficult for both gauge and radar |
MDT vs. CXY — Gauge Bias Decomposition
The documented ~11% mean annual precipitation bias between Harrisburg International (MDT) and Capital City Airport (CXY) illustrates how multiple bias sources stack:
| Bias Source | Direction at MDT | Magnitude (estimated) | Mechanism |
|---|---|---|---|
| Susquehanna River moisture enhancement | ↑ Real precipitation higher at MDT | ~4–6% | River evaporation augments low-level moisture convergence in SW flow |
| Valley convergence | ↑ Real precipitation higher at MDT | ~2–4% | Susquehanna Valley funnels low-level flow toward MDT on synoptic days |
| Frozen precipitation measurement | ↑ MDT catches more SWE (Alter shield) | ~1–3% | MDT has Alter-shielded gauge; CXY unshielded → CXY misses more snow |
| TMIN warm bias at MDT | ↓ Reduces solid fraction estimate | <1% | Airport heat island slightly suppresses solid classification for MDT |
| Net observed bias | MDT > CXY | ~11% annually | Physical, not artifactual — confirmed by spatial analysis |
The MDT/CXY bias is larger than can be explained by wind-induced gauge undercatch differences alone. The Adam & Lettenmaier framework would predict only 1–3% gauge catch difference between two nearby ASOS stations with similar wind exposure. The additional 8–10% is real physical precipitation difference driven by mesoscale geography — exactly the kind of signal that spatially averaged gridded bias corrections cannot resolve at ½° resolution.
PAPRISM500 Implications
The 500-point Pennsylvania climatological reconstruction presents a spatial interpolation problem where gauge undercatch corrections interact with the PRISM topographic regression. Key considerations:
- PRISM already partially corrects for elevation-precipitation relationships — but it uses raw gauge data as its training input, which includes undercatch bias. PRISM precipitation in high-wind ridgeline areas may be systematically low.
- ERA5's ~27 km resolution smooths out the valley-ridge precipitation gradient that the five-point beam integration is designed to capture.
- Monthly bias corrections applied after PRISM interpolation are preferable to correcting at the station level before interpolation — the corrections are already on a gridded climatological basis from the Adam & Lettenmaier framework.
- Cold-season months (Nov–Mar) warrant separate treatment given the nonlinear interaction between wind field, solid fraction, and PRISM's regression weighting of high-elevation gauges.
Gauge vs. Radar Bias: Integrated View
When evaluating any precipitation dataset for Pennsylvania, both gauge undercatch (corrected by Adam & Lettenmaier) and radar beam geometry effects (quantified by NEXRAD beam analysis) must be understood simultaneously. They interact in non-obvious ways.
Error Direction Matrix
| Condition | Gauge Error | Radar Error | Net Comparison |
|---|---|---|---|
| Warm-season rain, near-range | Gauge low (~3–5% liquid undercatch) | Radar accurate (Zone 1) | Radar > gauge; gauge correction closes gap |
| Cold-season snow, near-range | Gauge very low (10–50%+ solid undercatch) | Radar moderate (Zone 1-2) | Radar >> gauge; large correction needed |
| Cold-season snow, far-range | Gauge very low (10–50%+ solid undercatch) | Radar also very low (Zone 3-4 overshoot) | Both low; radar-gauge agreement is illusory |
| Warm-season convection, far-range | Gauge near-accurate (warm, low wind) | Radar undersamples cell tops | Gauge > radar; spatial sampling mismatch |
| Transition season mixed precip | Gauge partially biased | Bright band contamination | Both uncertain; ZR algorithm worst here |
The Illusory Agreement Problem
In winter, at long ranges (>150 km from radar), gauge undercatch and radar beam overshoot both push their respective measurements low. This means gauge–radar difference plots may show apparent good agreement while both instruments are simultaneously and substantially underestimating true surface precipitation. A low bias difference does not mean either instrument is accurate — it may mean both are wrong by similar amounts in the same direction. This is the most dangerous error mode in winter QPE verification.
Recommended Analysis Protocol
- Classify each gauge–radar comparison point by beam zone (using 5-point integration at appropriate elevation angle)
- Apply Adam & Lettenmaier CR corrections to gauge record first
- Apply NWS MLQPE or MRMS beam-block corrections to radar field
- Stratify gauge–radar comparisons by beam zone — only Zone 1 comparisons are diagnostic of gauge undercatch alone
- Zone 2 comparisons require bright-band correction to radar before gauge comparison
- Zones 3 and 4 comparisons should be excluded from gauge bias validation entirely
- For PAPRISM500 / historical reconstructions: where NEXRAD data predates 1994 (pre-WSR-88D), there is no independent radar truth — gauge corrections are the only available correction pathway
IEM ASOS Pipeline Notes (PA-Specific)
| Data Source | Precip Type Classification | Wind Data Available | Use for CR Correction? |
|---|---|---|---|
| ASOS 1-minute | Present weather sensor (METAR) | Yes — 2-min average at 10m | Yes — best available daily input |
| IEM COOP Daily | Observer-reported (manual) | No coincident wind | Use climatological wind from nearby ASOS |
| CoCoRaHS Daily | SWE only, no phase obs | No wind | Phase from temperature; wind from gridded reanalysis |
| ERA5 reanalysis | Phase from T2m / Td2m | 10m wind at ~27 km res | Modern wind proxy for historical corrections |
| BUFKIT RAP soundings (IEM) | Full sounding — best phase determination | Wind profile to determine z_0 layer | Gold standard for event-based analysis |
Dunkerley (2023): Recording Rainfall Intensity — Has an Optimum Method Been Found?
Dunkerley, D. (2023). Water, 15, 3383. https://doi.org/10.3390/w15193383 · Open Access CC BY 4.0 · Monash University
This is a landmark review paper cataloguing every known method for recording rainfall intensity — 15 categories spanning point-based gauges to satellite systems, from the 17th century through 2023. Its central finding is blunt: no standard or optimum method has emerged after 80+ years of systematic effort. New approaches are still actively being explored, and the widely-used tipping-bucket gauge is, in Dunkerley's assessment, poorly suited to the very thing it is most often used to measure.
The Ground Truth Problem
A central theme throughout the paper is what Dunkerley calls the "ground truth problem": identifying what rainfall intensity actually is at a point is itself deeply non-trivial. Radar and microwave link measurements need ground calibration — but the ground-level gauges used for calibration all have their own biases. There is no universally accepted reference for intensity the way the DFIR serves as a reference for total accumulation in the WMO solid precipitation intercomparisons.
Why Intensity Is Hard
Intensity is not the same as accumulation. It requires time-resolved measurement. The fundamental challenge is that rainfall intensity fluctuates continuously, sometimes by ~800 mm/h per minute during convective bursts. Hourly totals hide all of this. Even 1-minute TBRG data misses the structure of intensity fluctuations because tip events are not synchronized to clock time. Sub-minute resolution — ideally seconds — is required to faithfully capture intensity, yet almost no routine networks achieve this.
The 15 Method Categories
| # | Category | Type | Best Intensity Resolution | In Common Use? |
|---|---|---|---|---|
| 1 | Historical / obsolete (Jardí, timed-entry) | Point | ~15 s | No |
| 2 | Tipping-bucket (TBRG) | Point | ~1 min (unreliable) | Yes — dominant globally |
| 3 | Drop-forming & counting | Point | ~6–15 s | Limited research use |
| 4 | Disdrometers (impact, optical) | Point | 1 min | Yes — research networks |
| 5 | Weighing gauges (Geonor, Pluvio2) | Point | 1 min (≥7–12 mm/h threshold) | Yes — automated networks |
| 6 | Acoustic gauges | Point | 1 s or finer | Research only |
| 7 | Optical / camera / video | Point / path | ~2 s | Research / emerging |
| 8 | Thermal (hotplate) | Point | 1 min | Niche (snow measurement) |
| 9 | Weighing lysimeters | Point / area | 10 min | Research sites only |
| 10 | Other electro-mechanical (piezo, capacitive) | Point | Seconds | Research only |
| 11 | Radiation / nuclear (gamma flux) | Point | 1 min | Experimental |
| 12 | Radar (NEXRAD, micro-rain, X-band) | Area | ~5 min, km² resolution | Yes — operational |
| 13 | Microwave attenuation (CML / cellular) | Path / area | Minutes, path-averaged | Emerging operational |
| 14 | Seismic methods | Area | ~6 min | Research only |
| 15 | Miscellaneous (vehicles, wipers, phones) | Mobile / crowdsource | Seconds–minutes | Emerging / experimental |
Key Cross-Cutting Finding: The TBRG Is Used as Ground Truth But Shouldn't Be
The tipping-bucket rain gauge is the global standard used to validate disdrometers, radar, microwave links, and almost every other measurement method. Yet TBRGs are poorly suited to recording intensity — they cannot resolve rainfall duration accurately, they miss rain during tipping, they are biased at high rates, and they cannot detect drizzle or light rain that falls between tips. Dunkerley argues a better standard reference is genuinely needed before the field can make confident progress on intensity measurement.
Tipping-Bucket Rain Gauges (TBRG)
The most widespread recording gauge in the world. The see-saw mechanism records rainfall in fixed increments — typically 0.2 mm, less often 0.1 mm or 0.5 mm. The time of each tip is logged, and intensity is estimated from inter-tip intervals.
How Intensity Is Derived
Fundamental Error Sources
| Error Source | Direction | Magnitude | Notes |
|---|---|---|---|
| High-rate undercatch (tip during fill) | ↓ Underestimate | >10% at 100 mm/h; larger at higher rates | Rain arriving during the finite tip time allocated to already-full bucket |
| Intensity-dependent bucket capacity | ↓ Underestimate | Varies by design; needs dynamic calibration | Effective bucket capacity is not fixed — changes with rain rate |
| Partially filled bucket at rain end | ↓ Underestimate duration | Minutes to hours | Last real rain may occur well before final tip; TBRG cannot detect rain end |
| Non-clock-synchronised tip events | ↔ Timing ambiguity | Large in light rain | Tip events not tied to clock minutes; "1-min rain rates" from TBRGs are unreliable |
| Funnel wetting lag | ↓ Delays start detection | Minutes | Funnel must wet up before delivering water to buckets; first drops may not be counted |
| Wind undercatch | ↓ Underestimate | 2–50%+ depending on conditions | Standard gauge aerodynamics — addressed by Adam & Lettenmaier framework |
| Evaporative loss from funnel | ↓ Underestimate | Small but real | ~1346 cm² inclined funnel surface vs. ~700 cm² orifice area available for evaporation |
The Intermittency Problem
Rainfall routinely starts and stops multiple times per hour — intra-event intermittency. If a TBRG is used to estimate intensity during a 1-hour period containing 20 minutes of actual rain, dividing total mm by 60 minutes gives a mean rate of one-third the actual intensity. Even if you use RR = V/(T2−T1), you cannot identify the intermittent gaps — the gauge simply appears to tip slowly, indistinguishable from continuous light rain.
Dunkerley's acoustic field data showed a 1h 46min event delivering 5.6 mm at an average rate of 3.2 mm/h produced 924 acoustic voltage readings — but only 28 TBRG bucket tips. The TBRG record had no visibility into the rapid intensity fluctuations visible in the acoustic record.
Modifications and Improvements
| Modification | Developer | Improvement | Status |
|---|---|---|---|
| Weighing TBRG (bucket weight during fill) | Lee (2004), Kim & Lee (2004) | Eliminates tip-time intensity loss; 0.01 mm resolution | Commercial (Lambrecht "Rain[e]") — limited deployment |
| Dual-bucket design | Choi et al. (2022) | Upper 0.1 mm + lower 0.5 mm catches splash; take max of two | Research |
| Rotating disc (12 chambers) | Mink & Forrest (1976) | 0.005 mm resolution — 40× standard; stainless steel disc | Not in common use; requires tight manufacturing tolerances |
| Solenoid-valve dual-chamber | Drabbe (1975) | 0.1 mm resolution; automatic emptying | Not in common use |
| Triboelectric self-powered | Hu et al. (2022) | Float-lift nanogenerator — generates power from rainfall | Experimental |
| Dynamic calibration | Sypka (2019), Duchon & Biddle (2010) | Treats bucket capacity as variable function of intensity | Best current practice for high-rate events |
Syphon gauges (Dines tilting-syphon, R.M. Williams self-siphon) self-empty less frequently than TBRGs. The R.M. Williams gauge triggers at 50 mm accumulated. Syphon emptying takes ~30 s during which rainfall is not recorded — same fundamental deficiency as TBRGs during tip events. Widely used at sea (buoys, ships). Small collecting area (100 cm²) limits accuracy for large drops.
Weighing Rain Gauges
Weighing gauges continuously measure the accumulated mass of collected water. Unlike TBRGs, they can in principle record any intensity from the moment rainfall starts. The two dominant commercial designs are the Geonor T-200B (vibrating wire) and the OTT Pluvio2 (load cell).
Operating Principles
| Gauge | Sensing Method | Collecting Area | Resolution | Max Capacity | Min Detectable Rate |
|---|---|---|---|---|---|
| Geonor T-200B | 3 vibrating-wire sensors; resonant freq. ∝ load | 200 cm² | 0.05 mm (600 mm cap) / 0.1 mm (1500 mm cap) | 600–1500 mm | Not specified by Dunkerley; ~0.1 mm/h practical |
| OTT Pluvio2 | Load cell; DC voltage ∝ weight | 200 or 400 cm² | 0.001 mm/h (stated) | 750 mm (400cm²) / 1500 mm (200cm²) | ≥7–12 mm/h for reliable 1-min data (Saha et al.) |
| Lambrecht "Rain[e]" | Weighing + tipping hybrid | 200 cm² | 0.001 mm/h | — | Up to 1200 mm/h operational range |
Error Sources for Weighing Gauges
| Error Source | Effect | Notes |
|---|---|---|
| Temperature fluctuation | Spurious rainfall recorded | Thermal expansion/contraction of bucket creates apparent weight change (Knecht et al. 2019) |
| Wind-induced vibration | Noisy signal | Wind rock on gauge creates apparent weight changes; post-processing required (Nayak et al. 2008; Ross et al. 2020) |
| Low-intensity floor | Misses light rain | Below ~7 mm/h, 1-min weighing resolution insufficient to reliably detect each minute's accumulation |
| Oil/antifreeze contamination | Potential calibration drift | Oil added to reduce evaporation; antifreeze in winter — both affect density and weighing accuracy |
| Wind undercatch (solid) | Underestimate snow | Same problem as all catching gauges — addressed by SPICE/Kochendorfer transfer functions (see Modern Updates section) |
| Manual emptying required | Operational constraint | Buckets must be emptied before reaching capacity; at remote sites this limits deployment duration |
WMO-SPICE Connection
The Geonor T-200B and OTT Pluvio2 are the two gauges used in the WMO-SPICE experiments (Kochendorfer et al. 2017). The Universal Transfer Function (UTF) derived from SPICE was specifically developed for these weighing gauge / shield configurations. See the Modern Updates section for the UTF equations.
Weighing lysimeters (isolated soil monoliths on load cells) offer a related approach with zero wind-undercatch — the large surface area at ground level means drops arrive at natural trajectories. Gebler et al. (2015) found lysimeters recorded 16.4% more precipitation than a co-located TBRG at 1 m height, largely due to elimination of wind undercatch and ability to record dew. Lysimeters are research tools but represent a useful upper-bound estimate of true precipitation arrival.
Drop-Forming & Counting Gauges
Drop-counting gauges collect rainfall through a standard funnel, then deliver it to a narrow capillary tube (~3 mm inside diameter) that forms individual drops of known volume. Each drop is counted as it falls — optically (infrared beam) or electrically (contact with two fine electrodes). The key advantage: no waiting for a bucket to fill.
Performance Characteristics
| Design | Drop Volume | Equivalent Sensitivity | Response Time | Upper Limit |
|---|---|---|---|---|
| Norbury & White (1971) | ~0.07 mL | ~0.006 mm (33× finer than 0.2mm TBRG) | 10–15 s | >100 mm/h (trickle onset) |
| Stow & Dirks (1998) | ~3.0 mm diameter drop | ~0.006 mm | ~6 s | >50 mm/h (trickle onset) |
| Sansom & Gray (2002) "RIG" | Standard capillary | Detects 0.1 mm/h with 1–2 min lag | ~6 s | >100 mm/h |
Key Problems
- Trickle phenomenon: At intensities above ~50–100 mm/h, the capillary tube produces a continuous stream rather than discrete drops — the count-based approach breaks down entirely
- Drop volume is not constant: Drop volume varies with rain rate; dynamic calibration needed (same issue as TBRGs)
- Capillary fouling: Detritus, mineral deposits, and biological growth in the capillary tube alter drop volume over time; requires regular cleaning and an "aging-in" period after construction
- Overestimation at low rates: According to Stagnaro et al. (2021), drop-counting gauges tend to overestimate at low-to-intermediate intensities
- Still funnel-dependent: All the standard funnel problems apply (wetting lag, wind undercatch, evaporative loss from the large inclined funnel surface)
Applications
Primarily used in research networks studying storm fine-structure, microwave signal attenuation, and orographic rainfall (e.g., 10-gauge Pluvimate network in Tahiti by Sichoix & Benoit). Commercial examples include the Pluvimate drop-counting gauge. Not currently deployed in any routine monitoring network at scale.
Drop-counting gauges are one of the few instruments that can resolve true rainfall intermittency at the sub-minute scale and capture brief intensity bursts (e.g., a 30-second spike to 450 mm/h within an otherwise 20 mm/h storm) that are completely invisible in TBRG records. Field data from Norfolk Island by Stow & Dirks showed intensity jumps from 50 to 454 mm/h captured at ~15 s resolution — events that would appear as three or four consecutive TBRG tips with no indication of their true character.
Disdrometers
Disdrometers measure the drop size distribution (DSD) of precipitation — the number and sizes of drops per unit volume of air. Some also measure fall speeds. From DSD + fall speed, rain rate can be estimated. They are the primary instrument for understanding precipitation microphysics and for calibrating Z-R relationships used in radar QPE.
Major Types
| Type | Sensing Method | Sensing Area | Fall Speed? | Key Limitation |
|---|---|---|---|---|
| Joss-Waldvogel (JWD) RD-80 | Electromechanical impact; Styrofoam cone + sensing coil | 50 cm² | No — uses Gunn & Kinzer empirical relationship | Tiny sensing area; misses rare large drops; no direct fall speed |
| OTT Parsivel2 (optical) | Laser light sheet; shadow dimensions | ~54 cm² | Yes — two beams, fall speed from time-of-flight | Oblique drop entry in wind causes overestimated diameter, underestimated speed |
| Thies Laser (optical) | Laser light sheet | ~46 cm² | Yes | Consistently underestimates rain rate by ~16.5% (Fehlmann et al. 2020) |
| 2DVD (2D video) | Two orthogonal light sheets; full drop shape imaging | ~100 cm² | Yes | Expensive; fragile; large data files |
| Micro Rain Radar (MRR) | K-band Doppler radar; vertical pointing | Volumetric (~m³) | Yes — Doppler | Large drops overrepresented; complex in mixed-phase |
Critical Wind Effects on Disdrometers
When wind causes raindrops to enter a Parsivel disdrometer obliquely, they take a longer path through the laser sheet — producing an apparently slower fall speed and apparently larger diameter. Friedrich et al. (2013) showed this leads to anomalously large, slow drops being reported. At wind speeds >14 m/s, Lin et al. (2021) found a 22.4% underestimation of 5-min rainfall vs. a co-located TBRG. Thurai et al. (2019) found gusts reduce fall speeds by 25–30%. This is directly relevant when using Parsivel data for Z-R calibration at windy sites — every Pennsylvania Ridge & Valley station qualifies.
The Small-Area Sampling Problem
One 5 mm drop delivers the same volume as ~126 drops of 1 mm diameter. At a sensing area of 50 cm² and a 1-minute tallying interval, large drops are severely undersampled — they arrive too infrequently relative to their volumetric importance. Tapiador et al. (2017) found an optical disdrometer could underestimate rainfall intensity by up to 70% due to undersampling of large drops. The Marshall-Palmer exponential DSD assumption does not rescue this problem if the actual DSD has a heavier large-drop tail.
Cross-Instrument Disagreement
Multiple studies have found significant disagreement even among identical, co-located instruments. Tokay et al. (2005) found significant divergence among six co-located JWDs. Tapiador et al. (2017) found disagreements among 14 co-located laser disdrometers. Chang et al. (2020) compared 2DVD, MRR, X-band radar, and JWD — and found the JWD to be the least accurate despite being the most widely used calibration reference.
| Study | Instrument | Bias vs. Reference | Notes |
|---|---|---|---|
| Feloni et al. | JWD vs. TBRG | −2% | Good aggregate agreement — but TBRG itself is biased |
| Islam (2012) | Parsivel vs. TBRG (hourly) | −30% | Cherrapunji, India — small drops dominate local DSD |
| Jaffrain & Berne (2011) | Parsivel vs. TBRG (15 months) | −4.3% | Good long-term; location-specific result |
| Fehlmann et al. (2020) | Thies vs. reference | −16.5% | Consistent systematic underestimate |
| Lin et al. (2021) | Parsivel in wind >14 m/s | −22.4% | Wind-induced oblique entry bias |
Acoustic, Optical, Thermal & Novel Methods
Acoustic Gauges
Acoustic methods detect the sound of raindrops striking a surface — a metal plate, water surface, or soil — and relate the acoustic signal to intensity. Key advantage: no collecting funnel. Drops fall directly onto the sensor. This eliminates wetting lags, evaporative losses from funnel surfaces, and wind undercatch entirely. The first drop is detected with zero delay.
| Acoustic Method | Sensor | Time Resolution | Key Advantage | Key Problem |
|---|---|---|---|---|
| Metal roof/plate microphone | Microphone under surface | 10 s (Dunkerley 2023); 44.1 kHz possible | No funnel; zero lag; instant start/end detection | Extraneous noise (wind, traffic, wildlife); large data files |
| Submerged marine hydrophone | Underwater microphone | Seconds | Widely proven at sea; Black et al., Ma & Nystuen | Ocean environment only |
| Acoustic disdrometer (water tank) | Submerged microphone in tank | Seconds | Winder & Paulson: estimates drop sizes AND intensity from bubble acoustics | Requires water-filled tank in field |
| Sodar (sound radar) | Acoustic transceiver | ~minutes | Bradley & Webb: samples ~20 m³ volume; works at low intensities | Complex system; outdoor deployment challenges |
| Sommer RHD sensor | 160 mm stainless hemisphere, no moving parts | First-drop detection | Commercial; compact; no moving parts | Not yet validated for long-term operational intensity monitoring |
Dunkerley's own field data logged at 10 s intervals showed a 1h46m event with 924 voltage readings — vs. 28 TBRG tip events. The acoustic record revealed rapid intensity fluctuations (including a spike to ~55 mm/h) entirely invisible in the TBRG record. This single comparison illustrates the fundamental mismatch between operational gauge records and the actual intensity structure of rainfall.
Optical / Camera / Video Methods
| Method | Approach | Resolution | Status |
|---|---|---|---|
| Long-path optical (Bradley et al. 2000) | Halogen lamps → CCD camera, 2022 m path; OD = 0.86·RR^0.667 (r²=0.99) | 1 min | Research only; difficult validation over km path |
| Security / traffic cameras | Drop streak length in image frames with known exposure; fall speed estimation | ~2 s potential | Emerging — millions of potential sites globally; complex processing |
| Video frame analysis (Dong et al.) | Mean of 1000 successive frames estimates rain rate | Seconds | Problems identifying streaks in intense rain |
| CNN on smartphone images (Yin et al. 2023) | 1/200 s exposure images; convolutional neural net | Not directly clock-syncable | Emerging; potentially huge number of sites |
Thermal Methods
| Device | Principle | Range | Key Issue |
|---|---|---|---|
| Battalino & Vonnegut (1978) thermal sensor | Heater power required to evaporate incident drops → W = 3.71·RR^1.06 (r²=0.96) | 0.3–350 mm/h | ~1 kW needed at 100 mm/h; wind on horizontal cylinder untested |
| Hotplate precipitation gauge (Rasmussen et al. 2011) | Power to maintain hotplate constant temp ∝ precipitation rate | 0.25–35 mm/h | Best for snowfall; 1-min data aggregated to 5 min; low max rate |
| Raynor rotary detector (1955) | Heated rotating cylinder with electrode array; signals single-drop arrival | Single-drop sensitivity | Found 69 min of rain before first TBRG tip; warm updraft prevents dew/drizzle detection |
Radiation (Nuclear) Methods
Changes in atmospheric gamma-ray flux during rainfall are measurable. Zelinskiy et al. (2021) found RRm = 0.97·RRo + 0.71 (r² = 0.93) comparing gamma-dose model vs. TBRG + disdrometer at 1-minute resolution. Bottardi et al. (2020) linked ²¹⁴Pb flux increases to rainfall statistically. Potential for long-term monitoring using existing dosimeter networks — though requires wider testing across geology and vegetation types.
Areal Methods: Radar, Microwave Links, Seismic
Point-based gauges measure precipitation at a single location. For catchment hydrology, flash flood prediction, and urban drainage design, spatially distributed intensity data is needed. Three areal approaches are reviewed by Dunkerley.
Radar-Based Methods
Radar detects backscattered energy Z from hydrometeors. The Z-R relationship links this to rain rate R. The approach is indirect — calibration requires ground truth from gauges, and the ground truth problem means this calibration is never fully clean.
| Radar Type | Coverage | Temporal Res. | Key Limitation |
|---|---|---|---|
| WSR-88D NEXRAD (S-band) | Hundreds of km radius | ~5 min | Beam overshoot (see Spatial Context section); beam blockage; anomalous propagation |
| X-band (portable, local) | ~50–80 km | ~1 min | Signal attenuation in heavy rain; useful for urban applications |
| Micro-Rain Radar (MRR) | Vertical column, ~3 km height | ~1 min | Vertically pointing only; K-band attenuation in heavy rain; Chang et al. found MRR most accurate of 4 types |
| Lufft WS100 (low-power) | ~point | Seconds | Vokoun & Moravec (2022): reported rainfall far larger than conventional gauges in mountain tests |
Dai et al. (2019) showed that between the radar scan altitude and the ground, drops drift laterally in wind. A 5 mm drop can drift several kilometres; a 0.2 mm drop can drift up to 14 km. Drop diameter also declines during fall due to evaporation, reducing the intensity below what the radar scan suggested. This means even a perfectly accurate radar scan at beam height does not represent surface precipitation at the point directly below — a problem that grows with beam height and wind speed.
Microwave Attenuation (Cellular Links — CML)
Rain attenuates microwave signals along tower-to-tower cellular links. The greater the path-integrated attenuation, the higher the rainfall rate along the link. Global CML networks already exist in most countries — the infrastructure is in place. Urban areas are most densely covered.
| Advantage | Limitation |
|---|---|
| Vast existing infrastructure — no new gauges needed | Wet antennas cause attenuation independent of rain — overestimates when rain film forms on antenna |
| High spatial density especially in urban areas | Path-averaged, not point — cannot be directly compared with gauge data |
| Can detect rainfall onset / cessation | Rain may only cover part of the path — how much is unknown |
| Continuous real-time availability | Less effective at quantifying exact rainfall amounts vs. detecting occurrence |
Seismic Methods
Raindrop impact on soil creates seismic energy detectable by geophones. Bakker et al. (2022) showed that 90% of seismic power arises from drops >3 mm — making seismic monitoring best suited to intense rainfall with large drops. Diaz et al. (2023) used a high-density seismic network to track storm cell passage across Spain, collecting data at 6-min intervals. Like CML, seismic methods can provide spatial coverage — but validation against ground truth remains limited.
Miscellaneous / Crowdsourced
| Method | Signal Used | Key Finding |
|---|---|---|
| Windscreen wipers (Rabiei et al. 2013) | Wiper speed (manual or automatic sensor) | Relationship established in lab simulation; 1% of South Korea's 20M vehicles would yield enormous dataset even with wide uncertainty |
| Smartphone microphone (Gaucherel & Grimaldi) | Acoustic signal from rain on phone | "Pluviophone" — promising but requires careful acoustic design |
| Smart umbrella + phone (Guo et al. "Chaac") | 1 s and 10 s audio clips from umbrella attachment | Demonstrates concept; field validation limited |
| Citizen rain gauges (Mapiam et al. 2022) | Manual gauges read by citizens via app | Improved hourly radar bias correction in Thailand using two-step Kalman filter |
| Dashcam wiper activity (Bartos et al. 2019) | Wiper on/off from vehicle camera vision | High-accuracy binary rainfall maps (rain/no rain) from connected vehicles in US cities |
Comprehensive Method Comparison Matrix
Synthesized from Dunkerley (2023). Ratings reflect capability for rainfall intensity measurement specifically, not just accumulation. All methods perform better for accumulation than for intensity.
| Method | Time Res. | Wind Undercatch? | Low Intensity | High Intensity | Start/End Detection | Spatial Coverage | Ops. Cost |
|---|---|---|---|---|---|---|---|
| TBRG (standard) | ~1 min unreliable | Yes — significant | Poor (misses between tips) | Undercatch >10% at 100 mm/h | Poor (± minutes to hours) | Point only | Low |
| Weighing gauge (Geonor/Pluvio2) | 1 min (≥7 mm/h) | Yes (solid, correctable with SPICE) | Floor ~7 mm/h for 1-min data | Good to 1200 mm/h (Lambrecht) | Moderate (weight threshold) | Point only | Moderate |
| Drop-counting gauge | 6–15 s | Yes (has funnel) | Detects 0.1 mm/h with <2 min lag | Trickle onset >50–100 mm/h | Good (~minutes) | Point only | High (maintenance) |
| Disdrometer (Parsivel/JWD) | 1 min | Oblique entry in wind — 22% bias | Misses small drops | Misses rare large drops | Moderate | Point only | Moderate–High |
| Acoustic gauge | 1 s or finer | None (no funnel) | First-drop detection | Linear response across wide range | Excellent — first/last drop | Point only | Moderate (noise filtering) |
| Hotplate gauge | 1–5 min | Reduced (no funnel, but geometry untested) | 0.25 mm/h lower limit | 35 mm/h upper limit | Good | Point only | High (power consumption) |
| NEXRAD radar | ~5 min | N/A — not a surface gauge | Misses very light rain; clutter | Z-R saturation in heavy convection | Moderate | Hundreds of km² | Infrastructure exists |
| Micro-rain radar (MRR) | 1 min | N/A | K-band sensitivity threshold | Attenuation in heavy rain | Good | Vertical column only | Moderate |
| Cellular microwave links (CML) | Minutes | N/A | Wet antenna noise obscures light rain | Good for heavy rain detection | Moderate | Urban network coverage | Essentially free (data already collected) |
| Lysimeter | 10 min | None (ground level, natural trajectory) | Can detect dew; high sensitivity | Surface runoff risk in heavy rain | Moderate (weight threshold) | Point (large footprint) | Very high |
| Smartphone crowdsource | Seconds–minutes | N/A | Detection threshold uncertain | Saturates acoustic signal | Moderate | Potentially millions of points | Near-zero |
Dunkerley's Conclusions
No optimum method has been found. The TBRG remains dominant despite its known deficiencies because of cost, simplicity, and the value of the continuous historical record it has accumulated globally. The most promising paths forward, per Dunkerley, are: (1) acoustic methods for true high-resolution point intensity; (2) cellular microwave links for distributed urban coverage; (3) crowdsourced visual/acoustic data from smartphones and vehicles as a supplement to sparse gauge networks. None of these yet provides a validated replacement for the TBRG in the routine collection of long-term climatological data.
Relevance to Pennsylvania Precipitation Monitoring
| Context | Current Instrument | Key Gap | Recommended Supplement |
|---|---|---|---|
| ASOS (MDT, CXY, etc.) | Heated TBRG (FP-5 or equivalent) | Sub-1-min intensity; light rain miss; liquid undercatch | Disdrometer co-location; dynamic calibration |
| CoCoRaHS / COOP | Manual daily accumulation gauge | No intensity at all; next-day reading introduces timing errors | Cheap TBRG dataloggers; citizen acoustic sensors |
| WeeWX personal station (GW3000B / WS90) | Piezo-electric rain sensor | Intensity response non-linear; no TBRG-equivalent calibration | Ecowitt WH40 bucket gauge as primary; piezo as onset indicator |
| PAPRISM500 historical reconstruction | PRISM (based on COOP/ASOS) | All historical data is accumulation-based; no intensity in record | ERA5 precip rate fields for modern period; daily → hourly disaggregation with storm-type conditioning |
| Hershey Crowd IQ / M5StickC WiFi probe | Open-Meteo precip rate | ERA5 precip rate is ~hourly, ~27 km — cannot resolve convective bursts | Nearest ASOS 1-min data for intensity-triggered crowd behavior analysis |
Dunn et al. (2025): TBRG Undercatch — Error Framework
Dunn, R.E., Fowler, H.J., Green, A.C., Lewis, E. (2025). Tipping-bucket rain gauges: a review of the undercatch phenomenon, and methods for its reduction and correction. Weather, 80(6), 196–205. doi:10.1002/wea.7736 · Open Access CC BY · Newcastle University / Univ. of Manchester
Published June 2025 in Weather (Royal Meteorological Society), this is a tightly focused solution-oriented review — not just cataloguing problems but specifically tracing the evolution of design improvements and correction factor (CF) methodologies for TBRGs. The authors are explicit that previous reviews were problem-based rather than solution-oriented. Their goal: a framework useful for practitioners applying bias corrections today, culminating in three concrete recommendations for the field.
Two-Category Error Framework
All TBRG errors fall into two top-level categories (Ciach 2003; Villarini & Krajewski 2008), which subdivide further:
| Top Category | Subcategory | Nature | Typical Magnitude | Primary Mitigation |
|---|---|---|---|---|
| Spatial Sampling Errors | Network density / representativeness | Point measurement applied to area | Highly variable — 250m² catchment data covering km² area | Denser networks; radar/satellite fusion |
| Siting bias (exposure, rain shadows, regional inconsistency) | Systematic | Compounding — see Figure 3 of paper | Strategic siting; metadata documentation | |
| Local Errors — Catching | Aerodynamic undercatch (wind-induced) | Systematic | 5–46% liquid; up to 67%+ solid/frozen | Aerodynamic design; wind shields; lower mounting |
| Evaporation, sublimation, out-splash | Systematic | Evap: 4–11% (warm); Sublim: 0.5–0.75mm/day (cold); Splash: 1–2% | Funnel design; deep collector walls | |
| Local Errors — Counting | Mechanical / high-rate undercatch | Systematic | 2–30%; >30–50mm/h: rapid degradation | Dynamic calibration; gauge resolution increase |
| Random (human error, equipment failure) | Random | Unpredictable | Quality control; co-located gauges |
The Catching vs. Counting Distinction
This is the key conceptual split in the paper. Catching errors = the gap between rain falling from the sky and rain entering the gauge collector. Counting errors = the gap between rain entering the collector and rain being correctly measured. Wind undercatch is a catching error. High-rate mechanical undercatch (rain during tip) is a counting error. They compound — and their combined effect is what Adam & Lettenmaier's correction framework must address.
Mechanical counting errors are 2–30%. In stormy conditions, wind-induced catching errors alone can reach 75% underestimation (Neff 1977). The TBRG is a well-engineered counter of water volume delivered to it — it is simply that during the most hydrologically important events (storms, heavy rain, snow), it fails catastrophically at catching that water in the first place. This is why the Adam & Lettenmaier framework, which is fundamentally about catching errors, captures the dominant bias.
The Undercatch Phenomenon — Detailed Physics
Aerodynamic Undercatch: The Primary Mechanism
When wind flows around a TBRG, the gauge body creates a bluff-body disturbance — a zone of decelerated airflow above and around the orifice. This is unavoidable physics (Constantinescu et al. 2007). The deformed wind field deflects raindrop trajectories away from the collector (Cauteruccio et al. 2020). Three methods are used to quantify this:
| Method | What It Captures | Strengths | Limitations |
|---|---|---|---|
| Wind tunnel experiments | Flow deformation around specific gauge geometries | Controlled; isolates design variables; particle image velocimetry available | Artificial wind; cannot fully replicate natural turbulence or DSD |
| Computational fluid dynamics (CFD) | Full 3D flow field; embedded particle tracking for drop trajectories | No physical experiment needed; any geometry; any wind/DSD scenario | Requires validation; cannot include all real-world complexity |
| Field intercomparison (pit gauge reference) | Real-world integrated undercatch under all conditions | Ground truth closest to operational reality; no artificial conditions | Cannot isolate individual error sources; pit gauge itself has small errors |
Wind Speed Dependency
The magnitude of wind-induced undercatch scales directly with wind speed. Bratzev (1963), Larson (1971), and Larson & Peck (1974) all found approximately 1–2.2% additional undercatch per mph increase in wind speed for an unshielded gauge. At typical UK/US inland storm wind speeds (15–25 mph = 6.7–11.2 m/s), this implies 15–55% wind-induced catching error from wind alone — before any mechanical errors.
Additional Factors Modifying Wind Undercatch
| Factor | Direction of Effect | Physical Reason | Notes |
|---|---|---|---|
| Wind turbulence (vs. uniform flow) | Generally increases undercatch | Turbulent fluctuations create asymmetric trajectory deviations not captured in mean-wind equations | Cauteruccio 2020 PhD thesis shows turbulence specifically amplifies updraft above collector |
| Drop size distribution (DSD) | Small drops = more undercatch | Small drops have lower terminal velocity; their trajectories bend more in cross-wind; larger drops have more inertia | Nešpor & Sevruk (1999) — this is why DSD matters to CF development |
| Precipitation form (solid vs. liquid) | Solid vastly worse | Snow crystals have extremely low fall velocity and are easily deflected; can produce <33% catch in exposed sites | Benning & Yang (2005): recorded precipitation can be less than one-third of actual |
| Gauge height | Higher = more undercatch | Wind speed increases with height per log profile; higher orifice = more aerodynamic disturbance | Pollock et al. (2018); UK now installs TBRGs at ground level specifically to address this |
| Gauge design (shape) | Aerodynamic shapes reduce undercatch | Calix/champagne-glass profile minimizes flow separation above orifice | Colli et al. (2018) CFD; Cauteruccio et al. (2024) — current state of the art |
| Orifice rim geometry | Rim thickness and profile matter | Rim creates turbulence wake that affects the final drop trajectory as it crosses the orifice | Cauteruccio et al. (2021) — even subtle rim changes measurably affect catch efficiency |
Evaporation, Sublimation, and Splash Losses
| Error Type | Mechanism | Magnitude | Climate Dependency |
|---|---|---|---|
| Evaporation | Precipitation collected in funnel vaporises before draining to buckets; inclined funnel surface area ~1346 cm² vs. ~700 cm² orifice — nearly double the evaporation-prone surface | 4–11% in warm regions | Negligible in cool regions (Yang & Ohata 2001); significant in arid/warm climates |
| Sublimation | Solid precipitation collected in funnel sublimes directly to vapour | 0.5–0.75 mm/day where snowfall occurs (Fassnacht 2004) | Irrelevant in warm climates; significant for daily snow accumulation in cold climates |
| Out-splash | Droplet fragments on impact with collector rim/walls; not all fragments are captured | ~1–2% | Most modern deep-funnel gauges render this negligible (Legates 1992) |
In temperate climates like Pennsylvania or the UK, undercatch exhibits strong seasonality: it peaks in winter (high wind + frozen precipitation + sublimation), is moderate in autumn/spring (transitional — mixed precipitation, variable wind), and is lowest in summer (warm, less wind, all-liquid precipitation). Zhao et al. (2024) documented this pattern specifically for China. This seasonality directly affects the seasonal breakdown in Adam & Lettenmaier's results — DJF corrections are 2× JJA corrections globally (+15.2% vs. +7.3%).
Undercatch Reduction: Design & Installation
Evolution of TBRG Shape
The history of TBRG design is a history of progressively reducing aerodynamic disturbance. Four stages (Figure 7 of paper):
| Design Stage | Shape | Undercatch Improvement | Still in Use? |
|---|---|---|---|
| Early TBRGs | Plain cylinder | Baseline — worst aerodynamics; flat-top rim creates large wake | Legacy networks |
| Cylindrical with funnel top | Cylinder + cone | Marginal improvement from raised funnel | Common |
| Aerodynamic (plastic) — Institute of Hydrology design | Tapered body, reduced frontal area | Significant reduction in flow separation (Colli et al. 2018) | Widely deployed |
| Calix / champagne-glass shape (aluminium) | Inverted flared profile | Currently optimal per Folland (1988) and Pollock et al. (2018) | Best practice recommendation |
Detailed Design Parameters That Affect Undercatch
Beyond overall shape, the following gauge-level parameters have been quantified to affect catching efficiency:
- Orifice rim thickness and profile — Cauteruccio et al. (2021): rim geometry alters the turbulent wake over the collector opening. Sharper, thinner rims are preferable.
- Depth of collector vertical wall — deeper walls reduce splash losses and help retain drops that enter obliquely
- Slope of the funnel — steeper funnel angles speed drainage, reducing retention losses and funnel-surface evaporation
- Funnel coating — hydrophobic coatings reduce adhesion and wetting losses (WMO 1996; Mekis & Vincent 2019; Padrón et al. 2020)
- Funnel size in arid regions — larger funnel improves detection of <1mm events; however, Al-Wagdany (2015) found it introduces underestimation bias for events >10mm at high intensities. Trade-off must be evaluated regionally.
Installation Height: The UK Solution
TBRGs in the UK are now installed at ground level — orifice flush with the surrounding surface — following Pollock et al. (2018). This eliminates the log-profile wind speed amplification that affects elevated gauges. At ground level, wind speed is near zero, virtually eliminating aerodynamic undercatch. However, this practice increases the risk of debris blown into the orifice and in-splash from rain striking the ground near the gauge. It is not universally replicated — the US NWS 8" gauge is typically at 1.1m, and this discrepancy is exactly why the Adam & Lettenmaier wind profile scaling (Eq. 3-4) matters for US records.
Wind Shields
| Shield Type | Description | Undercatch Reduction | Practical Notes |
|---|---|---|---|
| Turf wall | Low wall of cut turf around gauge, height matching gauge orifice | Very effective — approximates pit gauge conditions | Requires regular maintenance; roots can disturb gauge base |
| Alter shield | Hanging vertically-oriented slats in a ring around gauge | Standard operational shield; also available as double-Alter | Most widely deployed operational shield; double-Alter better for snow (WMO-SPICE) |
| OTT screen style | Louvered cylindrical screen | Moderate — depends on louvre geometry | Integral with OTT Pluvio2 design; cleaner aesthetically |
| Tretyakov shield | Concentric rings of metal leaves | High performance for solid precip | Russian standard; used in WMO 1998 intercomparison as primary shield reference |
| DFIR (WMO reference) | Double-fence with inner Alter; 12m outer fence diameter | Reference standard — near-perfect for snow | Research use only; not operational |
Siting Considerations
Modern guidance emphasises careful siting alongside good gauge design. Key principles from Dunn et al. and the WMO:
- Highly exposed sites (open fields, ridge lines, airports) experience the worst wind undercatch — the aerodynamic equations apply maximally here
- Obstructions within ~4× their height should be avoided to prevent rain shadows (local precipitation deficit downwind)
- Terrain slope affects rainfall angle and can systematically bias catch in one wind direction
- Detailed metadata on gauge type, height, shielding, and siting must be recorded — this is the critical missing input for station-level CF development
- Network heterogeneity (mixing of gauge types and heights across a national network) is essentially unavoidable in multi-decadal records; CFs must account for this
Correction Factors — Table 1 Complete Reference
Dunn et al.'s Table 1 is the most comprehensive recent summary of TBRG catching error correction methodologies. All 8 methods reproduced below with full detail from the paper.
Every correction factor requires a "ground truth" reference. The choice of reference shapes the resulting CF fundamentally. Early CFs used ground-level gauges. Modern CFs use pit gauges (most common), shielded gauges (Kochendorfer et al. 2017 / SPICE), laboratory experiments (Nešpor & Sevruk 1999), or CFD numerical simulation (Cauteruccio et al. 2024). None of these is perfect — pit gauges still slightly underestimate; shielded gauges have their own biases; lab conditions don't replicate real turbulence; CFD cannot capture all atmospheric complexity. The choice of ground truth is the first and most consequential decision in CF development.
Required inputs: Air temperature, exposure level, wind speed at 10m AGL, precipitation intensity
Considers: Solid and liquid precipitation (binary); aerodynamic effects; wetting loss
Gauge scope: Hellmann gauge only
Key strength: Simple tabulated application; three exposure levels
Key weakness: Assumes ground-level exposure errors are negligible; wind speed often unavailable at TBRG sites; Hellmann-only
Required inputs: Location, date only
Considers: Wind effects, wetting losses, gauge evaporation
Gauge scope: Not gauge-specific — universal
Key strength: Global coverage; extremely easy to apply; foundation of Adam & Lettenmaier (2003)
Key weaknesses: Based on 1920–1980 data; arid/mountainous/polar underrepresented; ½° grid misses topographic variability; rough meteorological parameter estimates may cause over-correction; Ehsani & Behrangi (2022) found ~4% difference vs. GPCC CF-F method
Required inputs: Precipitation intensity, wind speed at gauge level, precipitation amount, gauge type, air temperature at 2m
Considers: DSD (assumed from intensity), crystal structure, site exposure
Gauge scope: 4 specific gauges with specific windshield setups
Key strength: DSD and crystal structure incorporated; exposure in correction formula
Key weakness: Tabulated parameters only valid for the 4 specific gauges/shields; evaporation/wetting provided as fixed monthly values in Nordic context
Required inputs: DSD, wind speed, precipitation intensity
Considers: DSD via rain type (orographic, thunderstorm, showers); multiple timescales
Gauge scope: Specific class of gauges (Mk2 type and similar geometry)
Key strength: Rain type as DSD parameter; applicable at multiple timescales; reduces over-correction risk
Key weakness: Only wind-induced errors; Mk2 gauge class only
Required inputs: Precipitation intensity (remote), snow fraction (remote), wind speed (remote ≤50km), temperature (remote)
Key strength: No on-site wind measurement required; remote stations can fill data gaps
Key weakness: Hellmann gauge only; Danish context; only tested on 12 gauges; single exposure level assumed; remote data still must be within 50–100km
Required inputs: Air temperature, dew point temperature, relative humidity, wind speed at gauge rim, precipitation intensity
Considers: Inconsistent precipitation form within single time increment via GPCC phase scheme
Key strength: Mixed precipitation handled correctly; currently used operationally in GPCC Monitoring product
Key weaknesses: Inherits Førland (1996) limitations; in tropical regions, calculated intensities may be too low → over-correction; wind speed at gauge rim is an unusual measurement; data-intensive
Required inputs: Air temperature, precipitation intensity, wind speed at 10m AGL
Considers: Trace precipitation, wetting losses, wind-induced errors, mixed precipitation
Key strength: Trace precipitation explicitly included; mixed precip handled via temperature-based partitioning
Key weaknesses: Does not incorporate gauge design or exposure; conservative assumptions for trace/wetting; no evaporation correction
Required inputs: Wind speed, precipitation intensity
Considers: Wind-induced losses, wetting losses, evaporation losses, trace amounts — all used in CF development though only wind speed needed operationally
Key strength: Minimal data requirements; wind speed alone sufficient to apply
Key weaknesses: Single study site only; specific gauge types and set-ups only; liquid precipitation only; no terrain exposure differentiation
Required inputs: Precipitation intensity, DSD, precipitation form, wind speed
Considers: Full gauge geometry effects including orifice rim; both liquid and solid particle dynamics
Key strength: Highly detailed physically-based investigation; explicit gauge design impact; captures nonlinear DSD-wind interactions
Key weaknesses: Complex to apply; developed in controlled laboratory conditions — not designed for direct operational application; only wind-induced undercatch; DSD and rim wind speed not routinely available at operational sites
Cross-CF Comparison: What Each Method Does and Doesn't Cover
| CF Method | Wind | Wetting | Evap. | Trace | Gauge-Specific | Operational? |
|---|---|---|---|---|---|---|
| Allerup & Madsen 1980 | ✓ | ✓ | ✗ | ✗ | Hellmann only | Yes |
| Legates & Willmott 1990 (CF-L) | ✓ | ✓ | ✓ | ✗ | Universal | Yes (GPCP) |
| Førland et al. 1996 | ✓ | ✓ | ✓ | ✗ | 4 gauges | Yes |
| Habib et al. 1999 | ✓ | ✗ | ✗ | ✗ | Mk2 class | Research |
| Fuchs et al. 2001 (CF-F) | ✓ | ✓ | ✓ | ✗ | Inherited 4 | Yes (GPCC) |
| Ye et al. 2004 | ✓ | ✓ | ✗ | ✓ | China gauges | Yes |
| Mekonnen et al. 2015 | ✓ | ✓ | ✓ | ✓ | Specific site | Limited |
| Cauteruccio et al. 2024 | ✓ | ✗ | ✗ | ✗ | Lab-specific | No |
| Adam & Lettenmaier 2003 | ✓ | ✓ | ✗ (excluded) | ✗ | 30 gauge types | Yes — gridded |
Over-Correction Risk
Universal CFs applied at coarse temporal scales can over-correct by up to 200%. The risk is highest when: (1) a globally-gridded CF is applied to a specific site with very different exposure than the surrounding grid cell average; (2) monthly CFs are applied to events that happened mostly in one extreme day within that month; (3) the CF was derived under conditions not representative of the target site's exposure. Dunn et al. explicitly warn that simple universal CFs "should be used with caution." A categorical approach — different CFs for liquid/mixed/solid, different exposure classes — is a recommended middle ground.
Future Directions: AI, Machine Learning & Emerging Technologies
Why This Matters Now (2025 Context)
Dunn et al. note that TBRGs have become more precise over time, requiring less correction. However, heterogeneous multi-decadal datasets — combining old cylindrical gauges, modern aerodynamic gauges, ground-level UK installations, elevated US NWS 8" gauges, and variable shielding — still require effective CFs for research continuity. The CF problem has not been solved; it has evolved.
Ehsani & Behrangi (2022) showed that existing CFs result in significantly different rainfall estimations — with the two major global products (GPCP using CF-L; GPCC using CF-F) differing by ~4% in annual global precipitation. Most global hydrological simulation datasets still do not account for rain gauge catch deficiencies at all (Adam & Lettenmaier 2003 is still cited as justification for not correcting).
Machine Learning and Big Data Approaches
| Approach | What It Enables | Current Status | Key Reference |
|---|---|---|---|
| Spaceborne sensor validation | GRACE satellite gravity anomalies → snowfall accumulation estimates → independent validation of high-latitude gauge undercatch corrections without ground truth | Research — Behrangi et al. 2018 applied to Arctic | Behrangi et al. (2018) |
| ML merging of heterogeneous sources | Combines TBRG, radar, satellite, CML, and other data to remove biases from TBRG datasets that traditional correction factors cannot address | Research — Guarascio et al. (2020) | Guarascio et al. (2020) |
| Neural network CNN on rainfall images | Smartphone/camera images → rainfall intensity without a gauge; potential for vast ground-truth dataset creation | Early research — Yin et al. (2023) | Yin et al. (2023) |
| Real-time gauge-mounted undercatch sensors | Small sensors on TBRG measure local wind speed, drop size, turbulence → real-time undercatch quantification and correction | Emerging — Dutton & Balsamo (2024) | Dutton & Balsamo (2024) |
| Rainfall simulator for CF development | Known controlled rainfall applied to gauge in field — eliminates ground truth ambiguity entirely; true catch efficiency measurable directly | Proposed — not yet demonstrated for undercatch research specifically | Dunn et al. (2025) recommendation |
Dunn et al.'s Three Formal Recommendations
- All future studies incorporating rain gauge data should acknowledge undercatch — either by explicitly recognising the uncertainty it introduces, or by applying an appropriate CF. Silence on undercatch is no longer acceptable given the breadth of evidence.
- Further research using rainfall simulators — to ensure the 'ground truth' is known and controlled. Existing pit gauges and shielded gauges still underestimate; a truly known-input experiment is the next frontier.
- Development of a UK-specific CF — versatile enough to be broadly adopted (possibly a categorical approach for liquid/mixed/solid × exposure class) but not so universal it leads to misrepresentation and over-correction. Explicitly: the Legates & Willmott global CF is insufficient for UK precision needs.
Connection to the PAPRISM500 and Pennsylvania Work
| Dunn et al. Issue | Pennsylvania Analog | Practical Implication |
|---|---|---|
| No single CF widely adopted | Adam & Lettenmaier (2003) still primary reference despite 22-year age | For PAPRISM500, A&L corrections are the best available but acknowledge their 1979–1998 basis |
| Heterogeneous network — mixing gauge types/heights | PA COOP network: mix of NWS 8" (1.1m), FP-5 ASOS (heated, ~1m), CoCoRaHS manual (0.3m) | Each gauge type needs its own CF; mixing without correction introduces systematic spatial bias in PRISM training data |
| Metadata absence prevents CF application | Historical PA COOP records: limited shield metadata; unknown exposure class | Use ERA5 wind climatology as surrogate; apply exposure class assumption based on land cover / station type |
| Seasonality of undercatch in temperate climates | PA DJF corrections 2× JJA corrections | Seasonal weighting essential for accurate PAPRISM500 monthly fields — do not apply a single annual CF |
| Over-correction risk from universal CFs | Applying A&L gridded CRs directly to valley stations in Ridge & Valley | Valley stations have lower effective wind than the surrounding ½° cell average implies; consider downscaling CF by exposure class |
| ML emerging for bias removal | PAPRISM500 GROS framework (Claude + ChatGPT + Gemini) | Future pipeline: train a random forest on known-exposure ASOS stations to predict exposure-corrected CRs for data-sparse COOP sites |
Supporting Papers — Complete Reference Library (16 Papers)
All 16 papers were read in full. Organized by the lineage they form — each paper built on those before it, creating a chain from 1999 first-principles CFD through 2025 operational corrections.
| # | Citation | Core Contribution | Connects To |
|---|---|---|---|
| 1 | Nešpor & Sevruk (1999) | First 3D numerical simulation of airflow + particle tracking for gauge undercatch. Gamma-type wind error function. | Origin of all CFD work in this library |
| 2 | Ciach (2003) | 15 co-located TBRGs — quantifies local random errors. Error standard deviation as function of intensity and timescale. | Error categorisation framework in Dunn 2025 |
| 3 | Sieck et al. (2007) | Goodwin Creek watershed — data quality control, out-of-level orifices, wind effects. Finds DSD-based correction no better than simple rate-based. | Practical limits of correction methods |
| 4 | Duchon & Biddle (2010) | 50 mm/h mechanical undercatch threshold. 5 m/s wind onset for elevated TBRG. Geonor pit as reference. | Key thresholds cited throughout all TBRG literature |
| 5 | Colli et al. (2016) Part I | RANS + LES CFD of single-Alter shielded Geonor T-200B. LES reveals turbulence underestimated by RANS above orifice rim. | Foundation for Colli 2018 and Cauteruccio chain |
| 6 | Colli et al. (2016) Part II | Lagrangian particle tracking of wet and dry snowflakes. Alter shield validated. LES CE lower than RANS. | First LES-based collection efficiency for snow |
| 7 | Sieck / Pollock (2018) | Field proof: calix/aerodynamic shape + ground-level install. Upland 23%+ undercatch, aerodynamic gauge 11.2%. CE vs wind r²=0.81. | Best-practice recommendation still current in 2025 |
| 8 | Colli et al. (2018) | CFD of 4 gauge geometries. Inverted conical best. Conventional cylinder worst. Recirculating flow above orifice improves CE. | Mechanistic proof for shape recommendations in Pollock |
| 9 | Angulo-Martínez et al. (2018) | Parsivel2 vs Thies LPM — 2 years, 200 events, peaks at 277 mm/h. Significant differences in PSVD, rain rate, radar reflectivity. | Disdrometer cross-instrument disagreement quantified |
| 10 | Kochendorfer et al. (2020) | Heated TB gauges at 5 WMO-SPICE sites. New multigauge transfer function optimized for long-term accumulation accuracy. | SPICE extension for ASOS-type heated TB gauges |
| 11 | Cauteruccio et al. (2021) | Wind tunnel with real water drops. High-speed camera captures drop trajectories. Validates LPT model. First direct visualization of wind-induced undercatch. | Validates all Cauteruccio CFD results experimentally |
| 12 | Johannsen et al. (2020) | 3 disdrometer types (PWS100, Thies LPM, Parsivel) + Pluvio2 in Austria. All underestimate vs. weighing gauge. KE correction factors 1.15–1.36. | Disdrometer performance under natural rainfall |
| 13 | Segovia-Cardozo et al. (2023) | Comprehensive TBRG review from hydrology perspective. Management as third error category beyond instrumental and environmental. | Problem-side complement to Dunn 2025 |
| 14 | Cauteruccio et al. (2024) | 6 commercial gauge CFD comparison — liquid and solid. Inverted conical and Nipher-shielded rank highest. All rank low for solid at light PI. | Most rigorous current CE dataset for gauge selection |
| 15 | Angeloni et al. (2024) | Thies 3D Stereo disdrometer evaluation vs. LPM. Good rain/snow classification agreement. 3DS detects more small particles. Both underestimate terminal velocity for drops >3mm. | State of imaging disdrometer technology 2024 |
| 16 | Cai et al. (2025) | CFD simulation + field experiment in China. 4.3m vs 0.7m vs 0m — 0.7m gauge catches ~95% of ground-level. 19% wind velocity increase in 5cm zone above collector. New correction formula based on routine meteorological elements. | Most recent (2025) CFD confirmation of height effect |
The Research Lineage
These papers form a clear chain. Nešpor & Sevruk (1999) introduced 3D numerical simulation — all subsequent CFD work cites it. Colli 2016 Parts I & II refined it with LES turbulence modeling for the Geonor-Alter configuration, specifically for snow. Cauteruccio 2021 validated the approach experimentally with actual water drops in a wind tunnel. Colli 2018 extended it to four different gauge shapes. Cauteruccio 2024 completed the picture by comparing six commercial gauges under both liquid and solid conditions. Meanwhile, Pollock 2018 took the field-measurement path — real gauges at real sites — and proved the design recommendations from CFD actually hold in practice. Sieck 2007, Duchon 2010, and Ciach 2003 form the empirical reality-check layer showing how complicated real-world measurement actually is and where clever correction methods hit their limits.
Pollock et al. (2018) — Quantifying and Mitigating Wind-Induced Undercatch
Water Resources Research, 54. doi:10.1029/2017WR022421 · Open Access · Newcastle University
The definitive modern field study on gauge shape and mounting height effects. Two sites — an exposed Scottish upland and an English lowland — with pit gauge reference, conventional cylinder gauges, and aerodynamic (calix-shaped) gauges at multiple heights. This paper is the single most cited recommendation for both ground-level installation and calix-shape design in current operational guidance.
Key Quantitative Results
The Height Effect — Core Finding
The same aerodynamic gauge at 1.5m undercatches 17.5%; at 0.5m it undercatches 11.2%. That 6.3 percentage-point difference from halving the height demonstrates conclusively that the vertical wind gradient near the ground matters as much as gauge shape. The paper quantifies this with logarithmic wind profile theory and confirms it field-experimentally.
The conventional cylinder gauge at 0.5m showed >23% undercatch while the aerodynamic gauge at the same height showed 11.2%. This 12-point improvement from shape alone — without changing height — establishes that gauge body geometry independently controls a large fraction of undercatch. Combining aerodynamic shape with lowest practical mounting height gives the largest achievable improvement short of pit installation.
Practical Recommendations from the Paper
- Mount gauges as close to the ground as practical — 0.5m or lower wherever debris/flooding risk allows
- Adopt calix/champagne-glass aerodynamic shapes as the new standard (rather than waiting for a gauge to fail and replacing with identical design)
- Upland sites require specific quantification — a single national undercatch factor is inadequate given the difference between upland (11.2% even with aerodynamic gauge) and lowland (3.4%) sites
- Drop-counting gauges cited as useful for finer time-resolution data to better characterise CE-wind relationships at sub-hourly scales
Connection to US NWS 8" Gauge
The NWS 8-inch gauge sits at 1.1m with a conventional cylindrical shape — exactly the worst-case configuration this paper tests. The upland 23%+ undercatch figure almost certainly applies to any US ASOS station in an exposed setting during rain storms. The Adam & Lettenmaier framework corrects for this with the log wind profile scaling, but the starting point — 1.1m conventional cylinder — is acknowledged even in the 2018 field literature as the configuration with the most room for improvement.
Cauteruccio et al. (2024) — Overall Collection Efficiency of 6 Gauge Types
Water Resources Research, 60, e2023WR035098. doi:10.1029/2023WR035098 · Open Access · University of Genova / WMO Lead Centre
The most comprehensive CFD collection efficiency study to date. Six commercial precipitation gauges with different outer geometries are compared under a full matrix of wind speeds and precipitation intensities (PI) for both liquid and solid precipitation. This is the paper that operationalizes the CE concept — moving from individual gauge studies to a direct comparison framework allowing gauge selection based on local precipitation climatology.
Six Gauges Studied
| Gauge | Shape Category | Liquid Performance | Solid Performance | Key Feature |
|---|---|---|---|---|
| Nipher-shielded | Shield + cylinder | High | Best for solid | Nipher shield reduces updraft velocity above orifice |
| Inverted conical (calix type) | Aerodynamic | High | Moderate | Recirculating flow structures above orifice improve catch |
| Second inverted conical | Aerodynamic variant | Good | Moderate | Geometry variant — shows sensitivity to subtle shape differences |
| Quasi-cylindrical A | Near-cylinder | Intermediate | Low | Modern operational form — slight taper reduces worst-case |
| Quasi-cylindrical B | Near-cylinder | Intermediate | Low | Different orifice rim treatment from quasi-cyl A |
| Chimney-shaped | Traditional extended cylinder | Lowest | Lowest | Tall narrow body maximises bluff-body flow separation above orifice |
Critical Finding: All Gauges Fail for Light Solid Precipitation
For solid precipitation at light to moderate precipitation intensity, all gauges except the Nipher-shielded rank low on collection efficiency. At light snowfall — which is precisely when snowfall records are most climatologically important in marginal snowpack environments like Pennsylvania — even the best-designed aerodynamic gauge has substantially degraded performance. This is the physics behind the 50%+ undercatch figures at low wind speeds that are reported for snowfall in exposed US ASOS sites.
CE Dependence on Precipitation Intensity — Why This Matters
The paper establishes a functional dependence of CE on PI that is non-monotonic and geometry-dependent. For liquid precipitation, CE generally improves at higher rainfall rates because larger drops have more inertia and resist aerodynamic deflection. For solid precipitation, the relationship is more complex because snowflake density and fall velocity both depend on crystal type and temperature. The practical implication: a single CE value per gauge type at a given wind speed is an approximation — the true CE varies continuously with precipitation intensity and must be integrated over the local PI distribution to get the climatologically correct correction.
Validation Chain
Results were validated against the Cauteruccio 2021 wind tunnel experiments (also in this library). The LPT model drag coefficient formulation was validated against direct drop trajectory measurements. This is the paper where the Dunn 2025 CF-9 entry originates — it's the most physically rigorous source currently available for gauge collection efficiency.
Nešpor & Sevruk (1999) — First Numerical Simulation of Gauge Wind Error
J. Atmospheric and Oceanic Technology, 16(4), 450–464. doi:10.1175/1520-0426(1999)016<0450> · ETH Zurich
The paper that started the numerical simulation approach to gauge undercatch. Everything in the Cauteruccio-Colli CFD chain cites this as the foundational method. The k-ε turbulence model and separate computation of airflow then particle trajectories remain the standard approach 26 years later.
What It Actually Did
Three operational precipitation gauges were modeled: a Hellmann-type (cylindrical), a UK standard gauge (Mk2 shape), and a smaller cylindrical gauge. Wind speeds 1–12 m/s. The airflow was computed using 3D k-ε RANS, then validated against 2D constant-temperature anemometer measurements in a wind tunnel. Drop trajectories were computed separately for monodisperse drops at each diameter, then integrated over a gamma drop size distribution — the standard DSD parameterization of the era.
Key Equations
Main Findings
- Wind error increases with decreasing rainfall rate — lighter rain has smaller drops, which deflect more
- Wind error increases with wind speed — the expected result, but now quantified per gauge geometry
- Wind error increases with fraction of small drops in the DSD — this is the DSD-dependence that all subsequent work grapples with
- Gauge comparison reveals real differences — the Mk2 shape performed differently from the cylinder despite similar orifice size
- Computed errors compared favorably to field measurements — validation against Sevruk (1989) field data
Limitation That Shaped 25 Years of Follow-On Work
The computational mesh was coarse by modern standards — limited by 1990s computing capacity. RANS k-ε underestimates turbulence above the orifice rim (as Colli 2016 later proved with LES). Nevertheless, the fundamental structure — CFD airflow → particle trajectory → CE — remained unchanged through Cauteruccio 2024. The paper explicitly stated results should be verified by measurements, which led directly to the Cauteruccio 2021 wind tunnel validation 22 years later.
Colli et al. (2018) — CFD Assessment of Aerodynamic Rain Gauge Performance
Water Resources Research, 54, 779–796. doi:10.1002/2017WR020549 · Open Access · Univ. Genova / Newcastle
The paper that put CFD numbers on gauge shape comparisons for liquid precipitation. Bridges the gap between Pollock's field observations and the theoretical CFD chain — Pollock showed the aerodynamic gauge works better in the field; Colli 2018 showed exactly why at the fluid dynamics level.
Four Gauge Geometries Compared
| Shape | Description | Aerodynamic Behaviour | Key Flow Feature |
|---|---|---|---|
| Conventional cylinder | Standard TBRG body, vertical walls | Worst — large separation zone | Flow separates sharply at rim edge; large turbulent wake extends over orifice |
| Chimney shape | Tall narrow cylinder | Poor — extended wake | High aspect ratio maximises updraft length above orifice |
| Inverted conical A | Flared body (calix type) | Best — recirculating flow | Inverted conical body generates recirculating eddies that partially direct drops inward |
| Inverted conical B | Different taper angle | Good — similar to A | Same recirculation mechanism with slightly different spatial extent |
The Recirculating Flow Discovery
The key aerodynamic finding is not just that the inverted conical shape reduces flow acceleration above the orifice — it actually creates recirculating flow structures near the orifice rim that partially counteract the deflecting effect. This is a qualitatively different mechanism than simply "less turbulence = better catch." The recirculating eddies briefly redirect drops that would otherwise miss the collector back toward it. This mechanism was not predicted by pre-CFD theory and explains why the performance improvement from aerodynamic gauges exceeds what simple wind-speed reduction would suggest.
RANS vs. LES Consistency
Time-averaged RANS was used (consistent with prior work), producing results consistent with Colli 2016's Part I RANS solutions. The paper notes that LES would resolve additional turbulence but is computationally prohibitive for the parameter sweep needed here. The RANS-LES comparison from Colli 2016 established that RANS provides reliable mean-field CE even if it underestimates turbulence fluctuations — sufficient for comparing shape performance.
Colli et al. (2016) Parts I & II — Shielded and Unshielded CE: RANS and LES
Journal of Hydrometeorology, 17, 231–243 (Part I) and 245–255 (Part II). doi:10.1175/JHM-D-15-0010.1 / JHM-D-15-0011.1 · AMS
Two companion papers studying the Geonor T-200B inside a single Alter shield — the configuration used in WMO-SPICE and in most automated network solid precipitation measurements. Part I establishes the airflow; Part II uses it for snowflake trajectory simulations. Together they are the first study to apply time-dependent LES to precipitation gauge aerodynamics and the first to reveal how badly RANS misses turbulence above the orifice rim.
Part I: RANS vs. LES — The Turbulence Revelation
Wind speeds 1–8 m/s were simulated. Both RANS (k-ω shear stress tensor) and LES (Smagorinsky subgrid) were run on the same geometry. Key findings:
- RANS confirms the Alter shield attenuates wind velocity above the gauge — this is expected and validates shield design
- However, RANS systematically underestimates turbulent kinetic energy above the orifice rim compared to LES
- LES reveals turbulent structures that propagate from the Alter blade edges and cross the orifice plane — an effect invisible in time-averaged RANS
- The intensity and spatial extent of the LES-resolved turbulent region depends on wind speed in a nonlinear way that RANS cannot capture
Part II: Snowflake Collection Efficiency
A Lagrangian trajectory model tracks wet snow and dry snow through the airflow fields from Part I. Key findings:
- LES-derived CE is consistently lower than RANS-derived CE — meaning RANS overestimates how well the gauge catches snow
- The difference between LES and RANS CE grows with wind speed — at higher winds, the turbulence effects (which RANS misses) become more important
- The single Alter shield is validated as effective — unshielded CE is substantially lower than shielded CE
- However, the Alter blades themselves generate turbulence that degrades CE relative to an idealized "perfectly smooth wind reduction" shield — the shield helps but also introduces new turbulence
- Wet snow (denser, faster falling) has higher CE than dry snow (lighter crystals, more easily deflected)
The WMO-SPICE transfer functions (Kochendorfer et al. 2017) are based on field measurements, so they implicitly include real turbulence effects. The CFD-based CE curves from RANS models, however, are likely optimistic — they would predict better performance than actually occurs. Any RANS-derived CF should be treated as a lower bound on correction needed.
Cauteruccio et al. (2021) — Wind Tunnel Validation of Particle Tracking
Water Resources Research, 57, e2020WR028766. doi:10.1029/2020WR028766 · Open Access · Univ. Genova / Politecnico di Milano
The paper that closes the loop between theory and experiment. Physical water drops were released in a wind tunnel at controlled speeds and imaged with a high-speed camera. The observed drop trajectories were compared against CFD + Lagrangian particle tracking predictions. This is the experimental proof that the numerical method accurately captures real physics — without which the entire Cauteruccio 2024 CE database would be physically unvalidated model output.
Experimental Setup
Full-scale gauge models placed in a wind tunnel at the Politecnico di Milano. Water drops released at known sizes and speeds. High-speed camera captured individual drop trajectories as they approached the gauge collector. Two gauge geometries tested — one with conventional rim, one with modified rim. Particle Image Velocimetry (PIV) measured the airflow field around the gauge. The CFD simulation was run with identical boundary conditions and the simulated drop trajectories were compared point-by-point against the photographed trajectories.
Key Result
The Lagrangian Particle Tracking model closely reproduced the observed drop trajectories for both gauge geometries tested. Individual drops were directly observed falling outside the collector when wind was present — exactly as the bluff-body theory predicts. The experiment provided the first direct visual evidence of the wind-induced undercatch mechanism: drops can be tracked frame-by-frame approaching the gauge, beginning their deflection in the acceleration zone above the orifice, and landing outside the collector. The CFD simulation matched these trajectories with good fidelity, validating its use for the full 6-gauge comparison in Cauteruccio 2024.
The Turbulence Attenuation Finding (From 2020 Companion Paper)
A companion 2020 paper established that free-stream turbulence (turbulence already present in the incoming wind) actually attenuates the updraft above the collector relative to laminar flow at the same mean wind speed. This means wind tunnel experiments in laminar flow slightly overestimate undercatch compared to real atmospheric conditions. The effect is larger for small drops than large drops — small drops' trajectories are sensitive to local velocity fluctuations that partially average out in a turbulent wind field.
Kochendorfer et al. (2020) — Undercatch Adjustments for Heated TB Gauges
Journal of Hydrometeorology, 21(6), 1193–1205. doi:10.1175/JHM-D-19-0256.1 · Open Access · NOAA/ARL
The WMO-SPICE extension for heated tipping-bucket gauges — directly relevant to ASOS networks including every Pennsylvania ASOS station. While the 2017 Kochendorfer papers covered weighing gauges, this paper covers the heated TB gauges that are actually deployed at NWS/ASOS sites.
Why Heated TB Gauges Are Different
Heated TB gauges melt solid precipitation before measurement — which means they can record snowfall amounts, but the melting process introduces delays (snow must melt before tipping) and the heated funnel creates an upward heat flux that can actually improve catch slightly by creating a thermal updraft that helps funnel snow toward the orifice, or degrade it by melting snow that then evaporates on the heated funnel surface. The net effect is gauge-specific and was not well characterized before SPICE.
Key Findings
| Finding | Detail | Operational Significance |
|---|---|---|
| 6 TB gauge types at 5 sites | Most gauge types tested at multiple sites — cross-site consistency assessed | Transfer functions are more portable than single-site derivations |
| New optimization criterion | Transfer functions minimise sum of errors over multiseasonal accumulation rather than instantaneous CE | Best long-term records, even if individual events less accurate — appropriate for climatological applications |
| Multigauge function outperforms gauge-specific | Function derived from 6 gauge types × 5 sites performs better than individual gauge CFs | Can potentially apply to unshielded heated TB gauges not evaluated in SPICE |
| Solid precipitation timing | Heated TB cannot accurately determine when snowfall occurred — only total amount | Event timing from heated TB records is unreliable; use SWE totals, not event timestamps |
Recommended Transfer Functions (Table 3 of paper)
Pennsylvania ASOS Application
The FP-5 and TE525 heated TB gauges used at Pennsylvania ASOS sites (MDT, CXY, LNS, ABE, IPT, etc.) are the gauge class addressed by this paper. For any analysis using ASOS 1-minute precipitation data during snow events, this transfer function — not the WMO 1998 NWS 8-inch equations — is the appropriate correction. The paper's recommendation to use long-term accumulations (seasonal totals) rather than event-by-event correction aligns with how the Adam & Lettenmaier climatological approach is applied.
Duchon & Biddle (2010) — Undercatch of TBRGs in High Rain Rate Events
Advances in Geosciences, 25, 11–15. CC BY 3.0 · University of Oklahoma
Short, focused, and exceptionally practical. A Geonor T-200B weighing gauge and two MetOne tipping-bucket gauges (one in a pit, one at 1m with an Alter shield) were compared during seven high rain rate events in Norman, Oklahoma. The paper established two thresholds that have been cited in virtually every TBRG undercatch study since.
The Two Thresholds
Experimental Design
Three gauge arrangement was ideal for separating mechanical from wind-induced errors:
- Geonor in pit (WP): Reference — no wind undercatch, no mechanical bucket errors. Ground level = no aerodynamic disturbance.
- MetOne in pit (TP): Removes wind undercatch — any difference from Geonor is mechanical bucket timing error only.
- MetOne at 1m with Alter shield (TN): Has both mechanical and wind undercatch — difference from TP is wind contribution only.
Key Time Series Result
During events with sustained 1-min rain rates above 50 mm/h, the TBRGs fell progressively further behind the weighing gauge. Even the pit TBRG (TP) — with zero wind undercatch — undercatched significantly at high rates. The mechanical undercatch grows roughly quadratically with rain rate above 50 mm/h. For a 100 mm/h event sustained over 30 minutes, the mechanical undercatch alone can approach 20–30% of total event precipitation.
Wind Onset Finding
At the 1m Alter-shielded gauge (TN), systematic divergence from the pit gauges began when 2m wind speed exceeded approximately 5 m/s. Below this threshold, TN tracked TP closely. Above it, TN read progressively lower. This is consistent with Pollock 2018's finding that even minor height increase matters — at 1m with an Alter shield in low wind, performance is acceptable; in moderate wind it degrades rapidly.
Central Pennsylvania thunderstorms routinely produce 1-min rain rates exceeding 50 mm/h. The FP-5 heated TBRG at MDT and CXY — the same MetOne-class heated bucket design studied here — will exhibit this mechanical undercatch during every significant summer thunderstorm. If you compare ASOS hourly totals to CoCoRaHS daily accumulations for summer storms and wonder why the ASOS reads lower, mechanical high-rate undercatch is often the dominant explanation, not wind undercatch.
Ciach (2003) — Local Random Errors in Tipping-Bucket Rain Gauge Measurements
J. Atmospheric and Oceanic Technology, 20, 752–759. AMS · University of Iowa
15 co-located PicoNet TBRGs deployed over an 8m × 8m area in Oklahoma — the data sample used here is 138mm of rainfall from September–December 1999. This paper is the source of the two-category error framework (systematic vs. random) that organizes the Dunn 2025 paper and most subsequent TBRG literature.
What Are Local Random Errors?
Even after removing all systematic biases (wind undercatch, wetting loss, calibration offsets), identical gauges separated by meters still disagree. Ciach calls these "local random errors" — they arise from: time-sampling effects caused by the discrete bucket-tip character of TB measurements; hydrodynamic instabilities in gauge funnels causing variable routing of water to buckets; micro-turbulence differences between co-located gauges. These errors are distinct from the area-point representativeness problem.
Key Quantitative Results
| Condition | Error Magnitude | Notes |
|---|---|---|
| 1-min accumulations, low rate | Very large — can be 100%+ of true value | At low rain rates, the quantization (tip resolution) dominates; most minutes record 0 when truth is 0.05mm |
| 1-min accumulations, high rate | ~20–40% | Even at 20mm/h, adjacent gauges can differ by this amount in any single minute |
| 5-min accumulations | ~10–20% | Improves substantially with averaging time |
| Hourly accumulations | ~5–10% | Most operational applications use hourly — this is the practical floor |
| Daily accumulations | ~2–5% | Daily CoCoRaHS data benefits from this averaging |
Two Processing Strategies Compared
Linear interpolation between tip times vs. tip-counting in fixed intervals — two different ways of computing 1-minute rain rates from the same tip records. The paper demonstrates the errors from each strategy differ significantly at short timescales, and which is better depends on rain rate. This is the authoritative source for why "1-minute TBRG rain rates" should not be treated as precise measurements of actual 1-minute precipitation.
Practical Takeaway for PA Work
For any station comparison study (MDT vs. CXY, ASOS vs. CoCoRaHS), random TBRG errors contribute genuine uncertainty even after all systematic corrections. At the daily accumulation level, the residual random error is 2–5%, which must be considered when interpreting differences smaller than this — including the 1% level corrections that distinguish Legates & Willmott from GPCC methods.
Sieck et al. (2007) — Challenges in Obtaining Reliable Measurements of Point Rainfall
Water Resources Research, 43, W01420. doi:10.1029/2005WR004519 · Univ. of Washington / Princeton
A comprehensive field-data study using the well-instrumented 21.4 km² Goodwin Creek watershed in northern Mississippi. Addresses data quality control, gauge calibration, out-of-level orifices, and wind effects — then critically evaluates whether sophisticated DSD-based wind corrections outperform simple rate-based methods. The answer is no, and understanding why is important.
The Surprising Finding: Complex Correction Not Better Than Simple
The paper evaluates Nešpor & Sevruk's (1999) DSD-based correction technique against simple rainfall rate and wind speed methods. Conclusion: "the sophisticated wind effect correction technique that makes use of raindrop size and wind information is much less effective than traditional methods based on rainfall rate and wind observations alone." The reason is that uncertainties in measuring DSD, wind at gauge rim height, and rainfall simultaneously are large enough to overwhelm the theoretical advantages of the physically more correct method. The inputs required for the sophisticated method are themselves unreliable enough to degrade rather than improve the correction.
Other Key Findings
- Out-of-level orifices: Even a 1–2° tilt creates systematic directional bias — more rain caught from the tilted-toward direction. Instrumentation maintenance is a first-order requirement before any correction makes sense.
- Calibration drift: Over a 2-year period at Goodwin Creek, several gauges developed measurable calibration drift — dynamic calibration recommended at least annually.
- Quality control limits: Automated QC flags can miss physically plausible but erroneous values; human review remains valuable for identifying out-of-level gauges and drift.
- Wind measurement at gauge rim: The key difficulty — wind at rim height is almost never measured; standard 10m anemometer + log profile scaling introduces its own uncertainty that the Nešpor method then amplifies.
Connection to Adam & Lettenmaier
Sieck 2007 validates the Adam & Lettenmaier choice to use simple rate-based and wind-speed-based corrections rather than DSD-based methods. The WMO equations (Table 2-1) require only wind speed, temperature, and precipitation type — not DSD measurements. The Sieck finding provides field-based justification for this pragmatic choice: adding DSD complexity doesn't improve accuracy if DSD measurements are themselves uncertain.
Segovia-Cardozo et al. (2023) — TBRG Measurement Uncertainties, Calibration, and Error Reduction
Sensors, 23(12), 5385. doi:10.3390/s23125385 · Open Access CC BY · Universidad Politécnica de Madrid
The most comprehensive recent review of TBRG uncertainties from a hydrological application perspective. Dunn et al. (2025) describes this as "problem-based" rather than solution-oriented — it is the counterpart to Dunn, exhaustively cataloguing what goes wrong rather than what to do about it. Includes a third error category — management — beyond the standard instrumental and environmental split.
Three Error Categories (Expanded from Ciach)
| Category | Sources | Typical Magnitude | Most Neglected Aspect |
|---|---|---|---|
| Instrumental | Mechanical undercatch at high rates; calibration drift; bucket hysteresis; clogging | 2–10% liquid; can be >50% for intense events | Dynamic calibration — gauges typically calibrated at fixed rate, not across range |
| Environmental | Wind undercatch; splash; evaporation; snow accumulation on funnel | 10–50%+ solid; 2–10% liquid | Wind speed at gauge rim height — almost never directly measured |
| Management | Maintenance frequency; vegetation growth near gauge; wildlife interference; network operator training | Variable but often >5% | Grass growing over gauge rim; spider webs blocking orifice; delayed emptying after storms |
Wind Undercatch Summary from Paper
The paper synthesizes the literature to confirm: wind biases cause approximately 2–10% undercatch for liquid precipitation and 10–50% for solid precipitation, depending on gauge geometry, wind speed, precipitation type, particle distribution, size, and intensity. These ranges are consistent with Adam & Lettenmaier (2003) but now grounded in 20 additional years of literature.
Why Calibration Methods Aren't Being Applied
A critical finding: despite decades of correction methodology development, "calibration methodologies are not frequently implemented by monitoring networks' operators or data users, propagating bias in databases." The main reason identified is lack of knowledge — network operators often don't know correction methods exist, and data users don't know to ask for them. This is the knowledge gap that references like the HTML document this section is embedded in are designed to address.
Emerging Technologies Section
The paper discusses cellular microwave backhaul links (CML) as an emerging supplement to sparse gauge networks — consistent with Dunkerley's (2023) similar recommendation. CML networks cover urban areas densely and are already operational globally; exploiting them for rainfall estimation requires only data-sharing agreements, not new hardware.
Disdrometer Comparison Papers (Three Studies)
Angulo-Martínez et al. (2018) — Parsivel2 vs. Thies LPM, 2 Years, Spain
Hydrology and Earth System Sciences, 22, 2811–2837. doi:10.5194/hess-22-2811-2018 · Open Access · EEAD-CSIC Zaragoza
Two Thies LPM and two OTT Parsivel2 disdrometers operated at Zaragoza, Spain for 2 years. 100,000 minutes of data, 30,000 minutes with rain, intensities up to 277 mm/h. The most comprehensive cross-instrument disdrometer comparison in this library.
| Variable | Parsivel2 vs. Thies LPM | Key Driver of Difference |
|---|---|---|
| Number of particles detected | Thies records more particles — especially small drops | Differing detection thresholds for D < 0.5mm |
| Drop size distribution | Significant differences — Thies shifts DSD toward smaller drops | Different binning schemes and internal corrections |
| Rain intensity R | Thies higher than Parsivel2 for same event | Small drop emphasis inflates R (small drops contribute less to volume per drop) |
| Radar reflectivity Z | Large differences — up to 6 dBZ for same event | Z is proportional to D⁶ so DSD differences are amplified enormously |
| Kinetic energy KE | Differences grow with intensity | DSD × velocity distribution both differ; compounds at high rate |
If two major commercial disdrometers disagree on Z by up to 6 dBZ for the same event, then Z-R relationships calibrated with one instrument cannot be directly applied to radar QPE without re-calibration when using data from the other. This is not a minor numerical detail — 6 dBZ corresponds to a factor of 4× difference in rain rate via a standard Z-R relationship. Any Z-R validation or calibration work must specify which disdrometer was used.
Johannsen et al. (2020) — Three Laser Disdrometers + OTT Pluvio2, Austria
Hydrological Sciences Journal, 65(4), 524–535. doi:10.1080/02626667.2019.1709641 · Taylor & Francis
PWS100 (Campbell Scientific) × 2, Thies LPM × 1, and first-generation OTT Parsivel × 1, co-located with an OTT Pluvio2 weighing gauge at Petzenkirchen, Austria. Focused on kinetic energy estimation for soil erosion applications — but the rainfall total biases are the most operationally important finding.
| Instrument | Bias vs. Pluvio2 (rainfall total) | KE correction factor needed |
|---|---|---|
| PWS100 (Campbell) | −2% to −5% | 1.15 |
| Thies LPM | −8% to −16% | 1.28 |
| OTT Parsivel (1st gen) | −20% to −29% | 1.36 |
All disdrometers underestimated total rainfall vs. the weighing gauge. The correction factors for kinetic energy (1.15 to 1.36) allow cross-instrument comparison but cannot be treated as universal — they were derived at a specific site in a specific climate.
Angeloni et al. (2024) — Thies 3D Stereo Disdrometer vs. LPM, Italy
Sensors, 24, 1562. doi:10.3390/s24051562 · Open Access CC BY · CNR-ISAC Rome
First scientific evaluation of the Thies 3D Stereo (3DS) imaging disdrometer — a very new instrument that captures 3D images of hydrometeors. Compared against the Thies LPM at L'Aquila, Italy during CORE-LAQ campaign covering both rain and snow events.
- Rain/snow classification: Excellent agreement between 3DS and LPM — both correctly identify rain and snow events
- Mixed phase and hail: Larger disagreements — the added imaging capability of 3DS is most valuable here
- Small particles (<1mm): 3DS detects more — larger measurement volume catches small drops that miss the narrower LPM laser sheet
- Terminal velocity for large drops (>3mm): Both instruments underestimate — consistent with known issue in all laser disdrometers at large drop sizes
- Total cumulative precipitation: Good agreement per event despite particle-level differences — the biases partially cancel in integration
- Image resolution: The raw 3DS images are useful for qualitative hydrometeor shape classification but not yet fine-resolution enough for precise shape analysis
Cai et al. (2025) — Wind Disturbance on Rainfall Collection Rate: CFD + Field
J. Physics: Conference Series, 2964, 012007. doi:10.1088/1742-6596/2964/1/012007 · Open Access · Nanjing Hydraulic Research Institute
The most recent paper in the library (2025) and the only one from a Chinese research institution. Field experiments at three heights (0m, 0.7m, 4.3m) combined with CFD simulation at the same site. Provides both empirical validation data and physical mechanism explanation for the height effect, plus a practical correction formula.
The Three-Height Field Experiment
| Height | Mean Collection Rate | Notes |
|---|---|---|
| 0 m (ground level) | Reference — 100% | Pit gauge equivalent — no aerodynamic disturbance |
| 0.7 m | ~95% of ground gauge | Only 5% loss — close to ground, limited wind disturbance |
| 4.3 m | Substantially lower | Order: 4.3m < 0.7m < 0m (ground gauge) |
This field result directly confirms Pollock 2018 — mounting height matters enormously. A gauge at 0.7m loses only ~5% while one at 4.3m loses substantially more. The exponential increase in undercatch with height is consistent with the logarithmic wind profile's implication that wind speed grows rapidly with height in the lowest meters of the atmosphere.
The CFD Mechanism Discovery
CFD simulation revealed that a high-gradient wind velocity distortion region forms within approximately 5 cm above the gauge collector orifice. Within this 5cm zone, wind velocity increases by approximately 19% relative to the undisturbed flow. Below this zone is a vortex "raceway" — a recirculating flow structure at the orifice rim. These two features together create uneven forces on precipitation particles in the critical region where drops must enter the collector. The 19% velocity increase in 5cm directly explains why reducing gauge height by even 0.5m can produce measurable improvement — you move the orifice further from the elevated wind gradient zone aloft.
The Correction Formula
Significance for US Network
The Chinese standard gauge is 0.7m — consistent with the Cai result showing ~5% undercatch at that height. The NWS 8-inch is at 1.1m. The linear interpolation between 0.7m (~5% loss) and 4.3m (substantially more) suggests the NWS 8-inch at 1.1m lies in a wind-exposure regime meaningfully worse than the optimal 0.7m — consistent with the WMO equations predicting greater undercatch at 1.1m than at 0.7m after log-profile scaling.