Error-Bounded Approximation of Data Stream: Methods and Theories

Xie, Q, Pang, C, Zhou, X, Zhang, X and Deng, K 2018, 'Error-Bounded Approximation of Data Stream: Methods and Theories' in Sayed-Mouchaweh M (ed.) Learning from Data Streams in Evolving Environments, Springer, Cham, Switzerland, pp. 93-122.


Document type: Book Chapter
Collection: Book Chapters

Title Error-Bounded Approximation of Data Stream: Methods and Theories
Author(s) Xie, Q
Pang, C
Zhou, X
Zhang, X
Deng, K
Year 2018
Title of book Learning from Data Streams in Evolving Environments
Publisher Springer
Place of publication Cham, Switzerland
Editor(s) Sayed-Mouchaweh M
Start page 93
End page 122
Subjects Information Engineering and Theory
Summary Since the development of sensor network and Internet of Things, the volume of data is rapidly increasing and the streaming data has attracted much attention recently. To efficiently process and explore data streams, the compact data representation is playing an important role, since the data approximations other than the original data items are usually applied in many stream mining tasks, such as clustering, classification, and correlation analysis. In this chapter, we focus on the maximum error-bounded approximation of data stream, which represents the streaming data with constrained approximation error on each data point. There are two criteria for the approximation solution: self-adaption over time for varied error bound and real-time processing. We reviewed the existing data approximation techniques and summarized some essential theories such as optimization guarantee. Two optimal linear-time algorithms are introduced to construct error-bounded piecewise linear representation for data stream. One generates the line segments by data convex analysis, and the other one is based on the transformed space, which can be extended to a general model. We theoretically analyzed and compared these two different spaces, and proved the theoretical equivalence between them, as well as the two algorithms.
Copyright notice © Springer International Publishing AG, part of Springer Nature 2019
DOI - identifier 10.1007/978-3-319-89803-2_5
ISBN 9783319898025
Versions
Version Filter Type
Altmetric details:
Access Statistics: 35 Abstract Views  -  Detailed Statistics
Created: Thu, 06 Dec 2018, 10:39:00 EST by Catalyst Administrator
© 2014 RMIT Research Repository • Powered by Fez SoftwareContact us