Abstract
We present the architecture of a “useful pattern” mining system that is capable of detecting thousands of different candlestick sequence patterns at the tick or any higher granularity levels. The system architecture is highly distributed and performs most of its highly compute-intensive aggregation calculations as complex but efficient distributed SQL queries on the relational databases that store the time-series. We present initial results from mining all frequent candlestick sequences with the characteristic property that when they occur then, with an average at least 60% probability, they signal a 2% or higher increase (or, alternatively, decrease) in a chosen property of the stock (e.g. close-value) within a given time-window (e.g. 5 days). Initial results from a first prototype implementation of the architecture show that after training on a large set of stocks, the system is capable of finding a significant number of candlestick sequences whose output signals (measured against an unseen set of stocks) have predictive accuracy which varies between 60% and 95% depended on the type of pattern.
Originalsprog | Engelsk |
---|---|
Titel | Proceedings of the 2nd International Conference on Pattern Recognition Applications and Methods (ICPRAM 2013) |
Antal sider | 4 |
Vol/bind | 20 |
Forlag | International Conference on Pattern Recognition Applications and Methods |
Publikationsdato | 2013 |
Sider | 608-612 |
ISBN (Trykt) | 978-989856541-9 |
Status | Udgivet - 2013 |
Begivenhed | ICPRAM 2013 - Barcelona, Spanien Varighed: 15 feb. 2013 → 18 feb. 2013 |
Konference
Konference | ICPRAM 2013 |
---|---|
Land/Område | Spanien |
By | Barcelona |
Periode | 15/02/2013 → 18/02/2013 |