Machine Learning Imputation of High Frequency Price Surveys

Machine Learning Imputation of High Frequency Price Surveys in Papua New Guinea

Capabilities to track fast-moving
economic developments re-main limited in many regions of the
developing world. This complicates prioritizing policies
aimed at supporting vulnerable populations. To gain insight
into the evolution of fluid events in a data scarce context,
this paper explores the ability of recent machine-learning
advances to produce continuous data in near-real-time by
imputing multiple entries in ongoing surveys. The paper
attempts to track inflation in fresh produce prices at the
local market level in Papua New Guinea, relying only on
incomplete and intermittent survey data. This application is
made challenging by high intra-month price volatility, low
cross-market price correlations, and weak price trends. The
modeling approach uses chained equations to produce an
ensemble prediction for multiple price quotes
simultaneously. The paper runs cross-validation of the
prediction strategy under different designs in terms of
markets, foods, and time periods covered. The results show
that when the survey is well-designed, imputations can
achieve accuracy that is attractive when compared to
costly–and logistically often infeasible–direct measurement.
The methods have wider applicability and could help to fill
crucial data gaps in data scarce regions such as the Pacific
Islands, especially in conjunction with specifically
designed continuous surveys.

Related Keywords

, Global Monitoring Database ,

© 2025 Vimarsana