科研成果 by Year: 2020

2020
Hao C, Li F, von Rosen D. A Bilinear Reduced Rank Model. In: Fan J, Pan J Contemporary Experimental Design, Multivariate Analysis and Data Mining. Springer Nature; 2020. 访问链接Abstract
This article considers a bilinear model that includes two different latent effects. The first effect has a direct influence on the response variable, whereas the second latent effect is assumed to first influence other latent variables, which in turn affect the response variable. In this article, latent variables are modelled via rank restrictions on unknown mean parameters and the models which are used are often referred to as reduced rank regression models. This article presents a likelihood-based approach that results in explicit estimators. In our model, the latent variables act as covariates that we know exist, but their direct influence is unknown and will therefore not be considered in detail. One example is if we observe hundreds of weather variables, but we cannot say which or how these variables affect plant growth.
Kalesan B, Zhao S, Poulson M, Neufeld M, Dechert T, Siracuse JJ, Zuo Y, Li F. Intersections of Firearm Suicide, Drug-Related Mortality, and Economic Dependency in Rural America. Journal of Surgical Research. 2020;256:96–102.
Li X, Kang Y, Li F. Forecasting with Time Series Imaging. Expert Systems with Applications [Internet]. 2020;160:113680. 访问链接Abstract
Feature-based time series representations have attracted substantial attention in a wide range of time series analysis methods. Recently, the use of time series features for forecast model averaging has been an emerging research focus in the forecasting community. Nonetheless, most of the existing approaches depend on the manual choice of an appropriate set of features. Exploiting machine learning methods to extract features from time series automatically becomes crucial in state-of-the-art time series analysis. In this paper, we introduce an automated approach to extract time series features based on time series imaging. We first transform time series into recurrence plots, from which local features can be extracted using computer vision algorithms. The extracted features are used for forecast model averaging. Our experiments show that forecasting based on automatically extracted features, with less human intervention and a more comprehensive view of the raw time series data, yields highly comparable performances with the best methods in the largest forecasting competition dataset (M4) and outperforms the top methods in the Tourism forecasting competition dataset.
Kang Y, Hyndman RJ, Li F. GRATIS: GeneRAting TIme Series with Diverse and Controllable Characteristics. Statistical Analysis and Data Mining: The ASA Data Science Journal [Internet]. 2020;13:354–376. 访问链接Abstract
The explosion of time series data in recent years has brought a flourish of new time series analysis methods, for forecasting, clustering, classification and other tasks. The evaluation of these new methods requires either collecting or simulating a diverse set of time series benchmarking data to enable reliable comparisons against alternative approaches. We propose GeneRAting TIme Series with diverse and controllable characteristics, named GRATIS, with the use of mixture autoregressive (MAR) models. We simulate sets of time series using MAR models and investigate the diversity and coverage of the generated time series in a time series feature space. By tuning the parameters of the MAR models, GRATIS is also able to efficiently generate new time series with controllable features. In general, as a costless surrogate to the traditional data collection approach, GRATIS can be used as an evaluation tool for tasks such as time series forecasting and classification. We illustrate the usefulness of our time series generation process through a time series forecasting application.