Prediction models » Historie » Verze 4
Zuzana Káčereková, 2021-03-29 21:50
1 | 1 | Zuzana Káčereková | h1. Prediction models |
---|---|---|---|
2 | 2 | Zuzana Káčereková | |
3 | After a consultation with a Machine Learning expert, we have arrived at several possibilities regarding prediction models. |
||
4 | |||
5 | 3 | Zuzana Káčereková | First of all, a good analysis of the input data is necessary. A careful selection of the input parameters should be made in a "less is more" manner: selecting too many parameters may lead to unpredictable behavior. |
6 | 1 | Zuzana Káčereková | |
7 | 3 | Zuzana Káčereková | Some suggested methods of examining the data had been: |
8 | |||
9 | * Finding, whether the data is geometrically interpretable |
||
10 | * Inspecting parameter correlation (using Excel functions) |
||
11 | * Projecting to 2D/3D via PCA and inspecting for clusters (using MATLAB) |
||
12 | |||
13 | If clusters were to be found, clustering could be used to determine several classes of expected "rush". This is, apparently, unlikely. If so, simple k-means clustering could be used to determine these classes. |
||
14 | |||
15 | The classifier itself should likely be a Naive Bayes Classifier, with clustering results providing the supervision. |
||
16 | |||
17 | Aggregate JIS card, authorization system, and occupancy data may, alternatively, be used to verify a prediction model, with an additional model of student "fall off" during the academic year, simply based on the date (Look into: Selection model). With this approach, the goal remains to provide a supervisor to the NBClassifier. |
||
18 | |||
19 | Using decision trees had also been suggested. |
||
20 | Good introductory resource: http://www.r2d3.us/visual-intro-to-machine-learning-part-1/ |
||
21 | 4 | Zuzana Káčereková | |
22 | Also see DBSCAN, possible use of occupancy data. |