Theoretical and Natural Science
- The Open Access Proceedings Series for Conferences
Vol. 30, 24 January 2024
* Author to whom correspondence should be addressed.
Dynamic models have been widely cited in predicting criminal population, residential electricity consumption, food prices and other objects. However, for total population predictions, dynamic models are rarely used. In this study, we will analyse the relationship between 13 variables such as CPI, grain prices, and medical expenditures and the total population of the United States, then combine it with the ARIMA model to generate a time series dynamic regression model. The conclusion is that, according to the parameters of the final model, two predictors (CPI and the number of crimes) and one interaction term (the product of the poverty rate and unemployment rate) are significantly related to changes in the population. Ultimately, the model performed well on the test set and was remarkably accurate for population prediction five years later. This report screens various factors influencing the total population and provides a broader background for applying dynamic models. In addition, this study also provides directions for subsequent research on more efficient dynamic models.
Population forecasts; dynamic regression models; time series
1. Cesario E, Catlett C and Talia D 2016 Forecasting crimes using autoregressive models. In 2016 IEEE 14th Intl Conf on Dependable, Autonomic and Secure Computing, 14th Intl Conf on Pervasive Intelligence and Computing, 795-802.
2. Chen P, Yuan H and Shu X 2008 Forecasting crime using the arima model. In 2008 fifth international conference on fuzzy systems and knowledge discovery, 5, 627-630.
3. Shaqiri F, Korn R and Truong H P 2023 Dynamic Regression Prediction Models for Customer Specific Electricity Consumption. Electricity, 4(2), 185-215.
4. Box G E, Jenkins G M, Reinsel G C and Ljung G M 2015 Time series analysis: forecasting and control. John Wiley & Sons.
5. Wang G, et al. 2021 Comparison of ARIMA, ES, GRNN and ARIMA–GRNN hybrid models to forecast the second wave of COVID-19 in India and the United States. Epidemiology & Infection, 149, 240.
6. Martin P 2022 Linear regression: An introduction to statistical models. Sage.
7. Shumway R H, Stoffer D S and Stoffer D S 2000 Time series analysis and its applications. New York: springer.
8. Akaike H 1974 A new look at the statistical model identification. IEEE transactions on automatic control, 19(6), 716-723.
9. Ljung G M and Box G E 1978 On a measure of lack of fit in time series models. Biometrika, 65(2), 297-303.
10. Hyndman R J and Athanasopoulos G 2018 Forecasting: principles and practice. OTexts.
The datasets used and/or analyzed during the current study will be available from the authors upon reasonable request.
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. Authors who publish this series agree to the following terms:
1. Authors retain copyright and grant the series right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this series.
2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the series's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this series.
3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See Open Access Instruction).