- a rolling look back period of n bars, similar to a moving average
- selecting non consecutive periods of price history with similarity to the most recent history
As can be seen there is nice separation between the market types and the SVM achieves over 98% cross-validation accuracy on this training set. Despite this, when applied to real market data I am yet again disappointed by the performance and choose for now to no longer pursue this avenue of investigation.
In addition to all the above, I have "discovered" the Octave sourceforge nan package, which I may begin to investigate more fully in due course. I have also been working through another Massive Open Online Course, this time Statistical Learning, which is in its last week at the moment. In this last week of the course I have been alerted to a possible new area of investigation, Distance correlation, which I had heard of before but not fully appreciated.
Finally, I have also been reassessing the code I use for calculating dominant cycle periods. It is these last two, distance correlation and the period code, that I'm going to look at more fully over the coming days.
Is the Statistical Learning course good? I have done Andrew Ng's ML one (though I skipped the exercises, probably to my detriment), and I'm currently working through a Scala one.
ReplyDeleteExperquisite,
ReplyDeleteThe course is at a basic, introductory level and for those areas common to both this course and Andrew Ng's ML course you probably won't learn anything new. However, there is some new stuff, namely Smoothing Splines, Decision Trees, Bootstrapping and Monte Carlo, Linear Discriminant Analysis and Hierarchical Clustering. I also found it useful that all coding is in R rather than Octave.