Projekt

Obecné

Profil

Data sources » Historie » Verze 36

Alex Konig, 2021-05-01 16:01

1 1 Alex Konig
h1. Data sources
2 2 Alex Konig
3 12 Alex Konig
h2. ZČU open data
4 1 Alex Konig
5 35 Alex Konig
More detailed data analyisis in pages:
6
* [[Weather at ZCU]]
7
* [[Jis activity - graphs]]
8
* [[Login activity at ZCU - graphs]]
9
10
Details about data file structure on page [[Data file structure]]
11
12 4 Alex Konig
All ZČU data can be downloaded in formats xml, csv and json.
13
14 22 Alex Konig
As discussed in further chapters there are certain complications with data sources not providing sufficient data granuality or amount. However there is a possibility that the data will in future contain more suitable datasets, and such should be at least acknowledged to some degree. However this is more of a topic for [[Prediction models]], where it will be further discussed. Further thorough the data standard university tags are used, however in some cases there is no source to find out what they mean (for example "parkoviště" or "STUD-PRA1") so we had to assume where they are.
15 4 Alex Konig
16 22 Alex Konig
To be able do display correct predictions we need to process this data in such a way that divides this data into data belonging to specific buildings. Those buildings are:
17 1 Alex Konig
18 22 Alex Konig
Buildings on campus:
19
* Fakulta strojní + ekonomická
20
* Fakulta designu a umění
21
* Fakulta aplikovaných věd
22
* Fakulta elektrotechnická
23
* Rektorát ZČU
24
* Menza
25
* Library
26
* CIV, ZV, UCV, IPC
27
* Univerzitní 14	
28 1 Alex Konig
29 22 Alex Konig
Dorms:
30
* Koleje armabeton
31
* Koleje Bory
32
* Koleje Lochotín
33
* Koleje klatovská
34 1 Alex Konig
35 22 Alex Konig
Buildings in the city:
36
* Dominikánská 9
37
* Husova 11
38
* Chodské náměstí 1
39
* Jungmannova 1, 3
40
* Klatovská 51
41
* Kollárova 19
42
* Riegrova 11, 17
43
* Sady Pětatřicátníků 14, 16
44
* Sedláčkova 15, 19, 31, 38-40, Veleslavínova 27-29, 42
45
* TESLOVA 5, 9, 9a, 11 - objekty C, F, G, H v areálu VTP Plzeň
46
* Tylova 59
47 1 Alex Konig
48 26 Alex Konig
Data from Cheb are discarded due to the buildings being far from the others which would cause problems in vizualization and also for worries of data containing too little relevant information.
49
50 22 Alex Konig
Classroom prefixes can be divided in the following way:
51 1 Alex Konig
52 22 Alex Konig
|_. Building |_. Abbreviation |_. Room prefixes |
53 33 Alex Konig
| Fakulta strojní + ekonomická | FST+FEK  | UV, UU, UK, UL, UP, UF, UH, UD, UX |
54 22 Alex Konig
| Fakulta designu a umění | FDU | LS |
55
| Fakulta aplikovaných věd | FAV | UN, UC, US |
56
| Fakulta elektrotechnická | FEL | EU, EK, EL, EP, ES, ET, EH, EZ |
57
| Rektorát ZČU | REK | UR |
58
| Menza | MENZA | - |
59
| Library | LIB | UB |
60
| CIV, ZV, UCV, IPC | CIV | UI |
61 29 Alex Konig
| Univerzitní 14 | UNI14 | UT |
62 22 Alex Konig
| | | |
63 29 Alex Konig
| Dominikánská 9 | DOM | DD |
64 30 Alex Konig
| Husova 11 | HUS | HJ |
65 29 Alex Konig
| Chodské náměstí 1 | CHOD | CH |
66
| Jungmannova 1, 3 | JUNG | JJ |
67
| Klatovská 51 | KLAT | KL |
68
| Kollárova 19 | KOLL | KO |
69
| Riegrova 11, 17 | RIEG | RJ, RS |
70
| Sady Pětatřicátníků 14, 16 | SADY | PC, PS |
71 32 Alex Konig
| Sedláčkova 15, 19, 31, 38-40, Veleslavínova 27-29, 42 | SED+VEL | SP, SD, ST, SO, VC |
72 29 Alex Konig
| TESLOVA 5, 9, 9a, 11 - objekty C, F, G, H v areálu VTP Plzeň | TES | T, TF, TG, TH |
73
| Tylova 59 | TYL | TY, TS |
74 23 Alex Konig
| | | |
75 29 Alex Konig
| Koleje armabeton | KARMA | - |
76
| Koleje Bory | KBORY | - |
77
| Koleje Lochotín | KLOCH | - |
78
| Koleje klatovská |  KKLAT | - |
79 1 Alex Konig
80 22 Alex Konig
For buildings and room abbrevations was used this source https://ps.zcu.cz/strediska/budovy-plzen.html
81 1 Alex Konig
82 21 Alex Konig
h3. Data timescale
83
84
All availible data was started to be collected at different dates, so therefore there is different amount for each dataset.
85
86
Jis data started to be recorded on 8.4.2018
87
88
Log-ins started to be recorded on 20.10.2011
89
90
Weather data started to be recorded on 30.4.2019
91 20 Alex Konig
92
Since jis and log-in data seems to follow the same trends every recorded year we decided to go off of data we have availible weather data, so from 30.4.2019 forward.
93 4 Alex Konig
94
95 3 Alex Konig
h3. Historical weather data
96
97 1 Alex Konig
Link to data: http://opendata.zcu.cz/Energeticky-dispecink.html
98
99 4 Alex Konig
Data contains:
100
101
* datum_a_cas - date and time, time at which the values were measured with hour accuracy
102
* teplota - average temperature in given time slot (°C)
103 7 Alex Konig
* vitr - average wind speed in given time slot (m/s)
104 2 Alex Konig
* dest - value signifying rain (1) and no rain (0)
105 7 Alex Konig
* svetelnost - average value of luminance (k lux)
106 1 Alex Konig
107 7 Alex Konig
For further processing luminance will be translated to the terms "sunny", "overcast" and "cloudy". In the 2019 data are values between 0 and 83.2k lux.
108 1 Alex Konig
109 8 Alex Konig
Lux values can be understood using the following table:
110 7 Alex Konig
111
|_. Conditions |_. Value (lux) |
112
| Sunlight | 107527 |
113
| Full Daylight	| 10752 |
114
| Overcast Day	| 1075 |
115
| Very Dark Day	| 107 |
116
| Twilight	| 10.8 |
117
| Deep Twilight	| 1.08 |
118
| Full Moon	| 0.108 |
119 1 Alex Konig
| Quarter Moon	| 0.0108 |
120 8 Alex Konig
| Starlight	| 0.0011 |
121 7 Alex Konig
| Overcast Night | 0.0001|
122
123 1 Alex Konig
Source: https://www.engineeringtoolbox.com/light-level-rooms-d_708.html
124 7 Alex Konig
125 8 Alex Konig
However, upon comparing values in data with archived weather predictions it seems more like the following table would be appropriate:
126 7 Alex Konig
127
|_. Conditions |_. Value (k lux) |
128
| Direct sungligt | >60 |
129 1 Alex Konig
| Sunny | 40-60 |
130
| Overcast | 20-40 |
131 24 Alex Konig
| Cloudy | 0-20 |
132 7 Alex Konig
| Night | 0 |
133 4 Alex Konig
134 19 Alex Konig
Used weather archive: https://www.in-pocasi.cz/archiv/archiv.php?historie=2019-12-01&region=9
135 18 Alex Konig
136 1 Alex Konig
More detailed data analysis in [[Weather at ZCU]]
137
138
h3. JIS data
139
140
Link to data: http://opendata.zcu.cz/Snimace-JIS.html
141
142
Data contains:
143
144
* datum_a_cas - timestamp of JIS authentication (accuracy in milliseconds)
145
* pocet_logu - number of authentized users in given time
146
* popis_objektu - description of object according to standard ZČU tagging
147
148
On the linked page there is written that " ... Data about dorms, the entry to laboratories and other spaces with restricted access, informations about university canteen, checkouts in univeristy library, access to copy machines etc can be interesting for students ...". However not all of these places can be found in said data. In data from 2019 are present only 46 different places, and most of them are dorms, parking lots and buffets.
149
150
There is a possibility that in the future the number of logged places will increase, however it is also possible that the data was affected by GDPR and more detailed data now won't be provided for the public anymore.
151 10 Alex Konig
152 24 Alex Konig
Possible solution is to assign provided spaces to buildings. More detailed data analysis of contained data in [[Jis activity - graphs]].
153 12 Alex Konig
154
|_\3. Dorms and gyms |
155 24 Alex Konig
|_. Dorm |_. Building |_. Location |
156 31 Alex Konig
| A1, A2-Hlavni vchod, A3, A2 | KARMA | on Borská street |
157
| B3-LEVY, B3-LevyVytah, B3-PRAVY, B3-PravyVytah, B3 | KBORY | on Baarova street |
158
| M16, M14 | KBORY | on Máchova street |
159
| L1, L2, L1L2-vchod | KLOCH | on Bolevecká street |
160
| L-Posilovna | KLOCH | in Bolevecká dorm | 
161
| KL-Posilovna, K1 | KKLAT | on Klatovská street |
162 12 Alex Konig
163 1 Alex Konig
164 11 Alex Konig
|_\3. Parking lots |
165 25 Alex Konig
|_. Place |_. Building |_. Notes |
166 1 Alex Konig
| Zavora-FEL | FEL | |
167 31 Alex Konig
| Zavora-Kaplirova | - | on Kaplířova street |
168 1 Alex Konig
| US 005 - závora vjezd, US 005 - mříž vjezd | FAV | |
169 20 Alex Konig
| Zavora-FDU | FDU | |
170 24 Alex Konig
| Parkoviste-vjezd, Parkoviste-vyjezd | all on campus | |
171 12 Alex Konig
| Zavora-NTIS-vjezd, Zavora-NTIS-vyjezd | FAV | |
172 31 Alex Konig
| VC-VJEZD, VC-VYJEZD | SED+VEL | on Veleslavínova street |
173 27 Alex Konig
| KolaBory-vnejsi, KolaBory-vnitrni | FST+FEK | |
174 24 Alex Konig
| EXT/kola | FST+FEK | |
175 12 Alex Konig
| EXT/kola-B | FAV | |
176 31 Alex Konig
| B3-kolarna | KBORY | on Baarova street |
177 1 Alex Konig
178 12 Alex Konig
|_\3. Food courts |
179 24 Alex Konig
|_. Name |_. Building |_. Notes |
180 12 Alex Konig
| EP-BUFET | FEL | |
181
| NTIS-BUFET | FAV | |
182 24 Alex Konig
| UV1-Bufet | FST+FEK | |
183
| MenzaKL-vydej | MENZA | |
184
| Menza4-kasa{x}| MENZA | x in range <1, 5> |
185
| Menza1-kasa-l, Menza1-kasa-p | MENZA | |
186 1 Alex Konig
187 12 Alex Konig
|_\3. Study rooms |
188 24 Alex Konig
|_. Room |_. Building |_. Notes |
189 31 Alex Konig
| STUD_VC53 | SED+VEL| on Veleslavínova street |
190
| STUD_KL20, STUD_KL87 | KLAT | on Klatovská street |
191
| STUD_PRA1 | SADY | |
192 24 Alex Konig
| STUD_UB113, STUD_UB211 | LIB | in the on campus library |
193 31 Alex Konig
| STUD_ST407 | SED+VEL | |
194 4 Alex Konig
195
196
h3. WebAuth data
197
198
Link to data: http://opendata.zcu.cz/Autentizacni-system.html
199 3 Alex Konig
200 4 Alex Konig
Data contains:
201 1 Alex Konig
202 4 Alex Konig
* datum - date of access
203 1 Alex Konig
* budova - building tag
204 4 Alex Konig
* hodina_zacatek - start of lecture
205
* hodina_konec - end of lecture
206
* pocet_prihlaseni - number of successfull sign-ins to given computer in given lecture
207 1 Alex Konig
* stroj_hostname - name of specific computer
208
* typ_objektu - type of object (classroom, laboratory, lecture room, other)
209
* ucebna_nazev - specific name of room
210 13 Alex Konig
* vyucovaci_hodina - number of lecture (according to the timetable)
211 4 Alex Konig
212 1 Alex Konig
On the linked page there is written that "... Signing in using orion login and password can also help track sign-ins to computers at ZČU and corresponding activity in computer laboratories ..." however it seems quesstionable if really all computer logins are in this data. Since it contains only 106 different rooms for all of ZČU in data from  the year 2019, which seems suspicious especially since some rooms that we know that they are equipped with computers and are being used (at least sometimes) are not present.
213 18 Alex Konig
214
So, it would be possible to again assign those rooms to the appropriate buldings using the table at the beggining of ZČU open data chapter and go off the assumption that a similar set of students will be attending lessons in the same building (which is often the case at least with KIV lectures).
215 4 Alex Konig
216
More detailed data analysis in [[Login activity at ZCU - graphs]].
217
218 1 Alex Konig
219 4 Alex Konig
h3. Occupancy data
220
221
Link to data: http://opendata.zcu.cz/Obsazeni-mistnosti.html
222
223 12 Alex Konig
Data contains:
224
225
* rok_platnosti - year
226
* budova - building tag
227
* ucebna_nazev - room name
228
* typ_objektu - type of room (učebna/laboratoř/posluchárna/jiné)
229
* kapacita_objektu - maximum capacity of room
230
* obsazeni - number of students enlisted
231
* predmet - abbreviation of timetable action
232
* typ_akce - type of lecture (seminář/přednáška/cvičení)
233
* vyucovaci_hodina - lesson number (according to the timetable)
234
* hodina_zacatek - lesson beggining
235 3 Alex Konig
* hodina_konec - lesson end
236 1 Alex Konig
* semestr - semester (Letní semestr/Zimní semestr)
237
* tyden - week (S(even), L(odd), K(every),J(other))
238
* tyden_v_roce -  week in the year
239 15 Alex Konig
* datum - date
240
241
It seems possible that not all lessons that are taught on ZČU are included in this data. Data from 2019+2020 contains only 1202 unique lesson instances.
242 1 Alex Konig
Also there are some instances without assigned building and room name, however this shouldn't be an issue since lessons are usually looked up by their abbrevation, not by room.
243 15 Alex Konig
How to work with lessons that are not included in these datasets is rather a topic either for [[Prediction models]] or handling user input.
244 1 Alex Konig
245
246
h2. Weather data
247
248
Link to data: http://wttr.in/Plzen,czechia?format=j1
249
250 15 Alex Konig
Data is in json file format and contains detailed weather prediction for Pilsen, CZ. For this application will be usefull mainly the following details:
251 1 Alex Konig
252 17 Alex Konig
Current weather:
253 15 Alex Konig
* localObsDateTime - date and time
254
* cloudcover - amount of clouds (values in range <0-100>)
255 17 Alex Konig
* temp_C - temperature (°C)
256 1 Alex Konig
* windspeedKmph - wind (km/h)
257 15 Alex Konig
 
258 36 Alex Konig
Prediction contains hourly prediction for following information:
259
* tempC - temperature (°C)
260 1 Alex Konig
* WindGustKmph - wind (km/h)
261 15 Alex Konig
* chanceofrain - chance of rain (0-100%)
262
* cloudcover - amount of clouds (values in range <0-100>)
263
264 36 Alex Konig
From cloudcover can be estimated values such as sunny/overcast and cloudy.
265
* 0-33 - sunny
266
* 33-66 - cloudy
267
* 66-100 - overcast
268
269
For current rain probability we need to consult predictions for today.