Abstract
In this demo, we present the StreamFitter system for real-time linear regression analysis on continuous data streams. In order to perform regression on data streams, it is necessary to continuously update the regression model while receiving new data. In this demo, we will present two approaches for on-line, multi-dimensional linear regression analysis of stream data, namely Incremental Mathematical Stream Regression (IMSR) and Approximate Stream Regression (ASR). These methods dynamically recompute the regression model, considering not only the data records of the current window, but also the synopsis of the previous data. Therefore, the refined parameters more accurately model the entire data stream. The demo will show that the proposed methods are not only efficient in time and space, but also generate better fitted regression functions compared to the traditional sliding window methods and well-adapted to data changes.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Babcock, B., Babu, S., Datar, M., Motwani, R., Widom, J.: Models and Issues in Data Stream Systems. In: PODS (March 2002)
Brown, R.G., Meyer, R.F.: The fundamental theorem of exponential smoothing. Operations Research 9(5), 673–685 (1961)
Keogh, E., Chu, S., Hart, D., Pazzani, M.: Segmenting Time Series: A Survey And Novel Approach. In: Data Mining in Time Series Databases. World Scientific Publishing Company, Singapore (2004)
Berk, R.A.: Regression Analysis: A Constructive Critique. Sage Publications, Thousand Oaks (2004)
Tokyo Stock Exchange, http://www.tse.or.jp/english/market/topix/data/index.html
Tropical Atmosphere Ocean  project, http://www.pmel.noaa.gov/tao/index.shtml
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nadungodage, C.H., Xia, Y., Li, F., Lee, J.J., Ge, J. (2011). StreamFitter: A Real Time Linear Regression Analysis System for Continuous Data Streams. In: Yu, J.X., Kim, M.H., Unland, R. (eds) Database Systems for Advanced Applications. DASFAA 2011. Lecture Notes in Computer Science, vol 6588. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20152-3_39
Download citation
DOI: https://doi.org/10.1007/978-3-642-20152-3_39
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20151-6
Online ISBN: 978-3-642-20152-3
eBook Packages: Computer ScienceComputer Science (R0)