This document discusses techniques for using data mining to design neural network time series predictors. It describes two categories of prior knowledge extraction: direct and indirect embedding. Direct embedding uses information theory, dynamical systems analysis, and stochastic modeling to provide insight into neural network architecture, parameters, and data sampling. Indirect embedding exploits known target function properties and non-stationarity detection to enlarge data sets and avoid unnecessary model redesign. The document provides examples of applying these techniques to improve time series prediction accuracy and efficiency.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
27 views12 pages
University of Computer Studies, Mandalay (UCSM)
This document discusses techniques for using data mining to design neural network time series predictors. It describes two categories of prior knowledge extraction: direct and indirect embedding. Direct embedding uses information theory, dynamical systems analysis, and stochastic modeling to provide insight into neural network architecture, parameters, and data sampling. Indirect embedding exploits known target function properties and non-stationarity detection to enlarge data sets and avoid unnecessary model redesign. The document provides examples of applying these techniques to improve time series prediction accuracy and efficiency.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 12
University of Computer Studies, Mandalay
(UCSM)
Faculty of Computer Science
Chapter 10 Data Mining Techniques for Designing Neural Network Time Series Predictors A real-life time series prediction system is usually subject to two constraints- accuracy and time, meaning that a sufficiently accurate prediction has to be provided in an imposed time frame. Cont’d Direct knowledge embedding methods, based on information theoretical modeling, dynamical system analysis, and stochastic modeling. It is illustrated that direct methods can produce a wealth of prior information regarding the choice of an appropriate neural network architecture, data sampling rates, as well as starting values for the model parameters, which otherwise have to be found as a result of costly trial-and-error procedures. Cont’d Indirect embedding methods which exploit known properties of the target function and non-stationarity detection techniques. The use of known properties of the target function can enlarge scarce data sets or enforce more accurate learning through constrained optimization. Non-stationarity analysis can considerably improve the computational efficiency of time series forecasting by avoiding the neural network model redesign more often than needed. Introduction The outcomes of a phenomenon over time form a time series. Time series are encountered in sciences as well as in real life. Example of time series: the voltage measured every second across a resistor in an electrical circuit, the number of cars passing a marker on a highway every minute, the yearly power consumption of the United States, the hourly exchange rate of German mark versus U.S. dollar, the daily car production of Chrysler corporation The observation of past outcomes of a phenomenon in order to anticipate its future behavior represents the essence of forecasting (prediction). Cont’d The design of an appropriate neural network for time series prediction problems with high data arrival rates (e.g. Internet traffic predictions or financial intra-day predictions) can be a challenging task, due to time consuming trial-and error architecture selection, non-linear parameter optimization and the need to devise new prediction models whenever the underlying data distribution changes. For these reasons, any prior knowledge that could be extracted from a time series under study can dramatically decrease the design time of a predictor and also improve its prediction accuracy significantly. Cont’d Two categories of prior knowledge extraction techniques, which are embeddable into neural network prediction models. The first category, denoted as direct knowledge embedding encompasses information theory, non-linear dynamics, and stochastic analysis. These techniques of exploratory data analysis can provide prior knowledge regarding appropriate neural network architecture, initial network parameters and adequate data sampling rate. A real-life time series (compressed video traffic data), as well as an artificial, non-linear, chaotic time series (Mackey-Glass data) are used to illustrate the embedding of prior knowledge extracted from stochastic analysis into the neural network design process. Cont’d The second category, indirect knowledge embedding, addressed, includes the use of known properties of target functions and non-stationarity detection. The use of known properties of the target function can enlarge scarce data sets by creating additional artificial training examples, or enforce more accurate learning through constrained optimization. Non-stationarity analysis can considerably improve the computational efficiency of time series forecasting by avoiding the neural network model re-design more often than needed. Direct Information Extraction Procedures Information Theory Dynamical System Analysis Stochastic Analysis Indirect Information Extraction Procedures Knowledge of Properties of the Target Function Non- Stationarity Detection Knowledge of Properties of the Target Function Some properties of the function to be approximated are known ahead of time. The use of these properties, called hints, is of major importance especially in problems with scarce, costly to obtain, or noisy data, like financial forecasting problems in which hints can improve the model’s accuracy dramatically. A non-valid hint can deteriorate the performance of the model considerably, so care must be taken in order to analyze the validity of hints. Cont’d Two modalities for incorporating hints in the neural network learning process are proposed: creating additional “virtual” training examples; . imposing constraints on the learning process by modifying the cost function.