Data sources » Historie » Verze 36
Alex Konig, 2021-05-01 16:01
1 | 1 | Alex Konig | h1. Data sources |
---|---|---|---|
2 | 2 | Alex Konig | |
3 | 12 | Alex Konig | h2. ZČU open data |
4 | 1 | Alex Konig | |
5 | 35 | Alex Konig | More detailed data analyisis in pages: |
6 | * [[Weather at ZCU]] |
||
7 | * [[Jis activity - graphs]] |
||
8 | * [[Login activity at ZCU - graphs]] |
||
9 | |||
10 | Details about data file structure on page [[Data file structure]] |
||
11 | |||
12 | 4 | Alex Konig | All ZČU data can be downloaded in formats xml, csv and json. |
13 | |||
14 | 22 | Alex Konig | As discussed in further chapters there are certain complications with data sources not providing sufficient data granuality or amount. However there is a possibility that the data will in future contain more suitable datasets, and such should be at least acknowledged to some degree. However this is more of a topic for [[Prediction models]], where it will be further discussed. Further thorough the data standard university tags are used, however in some cases there is no source to find out what they mean (for example "parkoviště" or "STUD-PRA1") so we had to assume where they are. |
15 | 4 | Alex Konig | |
16 | 22 | Alex Konig | To be able do display correct predictions we need to process this data in such a way that divides this data into data belonging to specific buildings. Those buildings are: |
17 | 1 | Alex Konig | |
18 | 22 | Alex Konig | Buildings on campus: |
19 | * Fakulta strojní + ekonomická |
||
20 | * Fakulta designu a umění |
||
21 | * Fakulta aplikovaných věd |
||
22 | * Fakulta elektrotechnická |
||
23 | * Rektorát ZČU |
||
24 | * Menza |
||
25 | * Library |
||
26 | * CIV, ZV, UCV, IPC |
||
27 | * Univerzitní 14 |
||
28 | 1 | Alex Konig | |
29 | 22 | Alex Konig | Dorms: |
30 | * Koleje armabeton |
||
31 | * Koleje Bory |
||
32 | * Koleje Lochotín |
||
33 | * Koleje klatovská |
||
34 | 1 | Alex Konig | |
35 | 22 | Alex Konig | Buildings in the city: |
36 | * Dominikánská 9 |
||
37 | * Husova 11 |
||
38 | * Chodské náměstí 1 |
||
39 | * Jungmannova 1, 3 |
||
40 | * Klatovská 51 |
||
41 | * Kollárova 19 |
||
42 | * Riegrova 11, 17 |
||
43 | * Sady Pětatřicátníků 14, 16 |
||
44 | * Sedláčkova 15, 19, 31, 38-40, Veleslavínova 27-29, 42 |
||
45 | * TESLOVA 5, 9, 9a, 11 - objekty C, F, G, H v areálu VTP Plzeň |
||
46 | * Tylova 59 |
||
47 | 1 | Alex Konig | |
48 | 26 | Alex Konig | Data from Cheb are discarded due to the buildings being far from the others which would cause problems in vizualization and also for worries of data containing too little relevant information. |
49 | |||
50 | 22 | Alex Konig | Classroom prefixes can be divided in the following way: |
51 | 1 | Alex Konig | |
52 | 22 | Alex Konig | |_. Building |_. Abbreviation |_. Room prefixes | |
53 | 33 | Alex Konig | | Fakulta strojní + ekonomická | FST+FEK | UV, UU, UK, UL, UP, UF, UH, UD, UX | |
54 | 22 | Alex Konig | | Fakulta designu a umění | FDU | LS | |
55 | | Fakulta aplikovaných věd | FAV | UN, UC, US | |
||
56 | | Fakulta elektrotechnická | FEL | EU, EK, EL, EP, ES, ET, EH, EZ | |
||
57 | | Rektorát ZČU | REK | UR | |
||
58 | | Menza | MENZA | - | |
||
59 | | Library | LIB | UB | |
||
60 | | CIV, ZV, UCV, IPC | CIV | UI | |
||
61 | 29 | Alex Konig | | Univerzitní 14 | UNI14 | UT | |
62 | 22 | Alex Konig | | | | | |
63 | 29 | Alex Konig | | Dominikánská 9 | DOM | DD | |
64 | 30 | Alex Konig | | Husova 11 | HUS | HJ | |
65 | 29 | Alex Konig | | Chodské náměstí 1 | CHOD | CH | |
66 | | Jungmannova 1, 3 | JUNG | JJ | |
||
67 | | Klatovská 51 | KLAT | KL | |
||
68 | | Kollárova 19 | KOLL | KO | |
||
69 | | Riegrova 11, 17 | RIEG | RJ, RS | |
||
70 | | Sady Pětatřicátníků 14, 16 | SADY | PC, PS | |
||
71 | 32 | Alex Konig | | Sedláčkova 15, 19, 31, 38-40, Veleslavínova 27-29, 42 | SED+VEL | SP, SD, ST, SO, VC | |
72 | 29 | Alex Konig | | TESLOVA 5, 9, 9a, 11 - objekty C, F, G, H v areálu VTP Plzeň | TES | T, TF, TG, TH | |
73 | | Tylova 59 | TYL | TY, TS | |
||
74 | 23 | Alex Konig | | | | | |
75 | 29 | Alex Konig | | Koleje armabeton | KARMA | - | |
76 | | Koleje Bory | KBORY | - | |
||
77 | | Koleje Lochotín | KLOCH | - | |
||
78 | | Koleje klatovská | KKLAT | - | |
||
79 | 1 | Alex Konig | |
80 | 22 | Alex Konig | For buildings and room abbrevations was used this source https://ps.zcu.cz/strediska/budovy-plzen.html |
81 | 1 | Alex Konig | |
82 | 21 | Alex Konig | h3. Data timescale |
83 | |||
84 | All availible data was started to be collected at different dates, so therefore there is different amount for each dataset. |
||
85 | |||
86 | Jis data started to be recorded on 8.4.2018 |
||
87 | |||
88 | Log-ins started to be recorded on 20.10.2011 |
||
89 | |||
90 | Weather data started to be recorded on 30.4.2019 |
||
91 | 20 | Alex Konig | |
92 | Since jis and log-in data seems to follow the same trends every recorded year we decided to go off of data we have availible weather data, so from 30.4.2019 forward. |
||
93 | 4 | Alex Konig | |
94 | |||
95 | 3 | Alex Konig | h3. Historical weather data |
96 | |||
97 | 1 | Alex Konig | Link to data: http://opendata.zcu.cz/Energeticky-dispecink.html |
98 | |||
99 | 4 | Alex Konig | Data contains: |
100 | |||
101 | * datum_a_cas - date and time, time at which the values were measured with hour accuracy |
||
102 | * teplota - average temperature in given time slot (°C) |
||
103 | 7 | Alex Konig | * vitr - average wind speed in given time slot (m/s) |
104 | 2 | Alex Konig | * dest - value signifying rain (1) and no rain (0) |
105 | 7 | Alex Konig | * svetelnost - average value of luminance (k lux) |
106 | 1 | Alex Konig | |
107 | 7 | Alex Konig | For further processing luminance will be translated to the terms "sunny", "overcast" and "cloudy". In the 2019 data are values between 0 and 83.2k lux. |
108 | 1 | Alex Konig | |
109 | 8 | Alex Konig | Lux values can be understood using the following table: |
110 | 7 | Alex Konig | |
111 | |_. Conditions |_. Value (lux) | |
||
112 | | Sunlight | 107527 | |
||
113 | | Full Daylight | 10752 | |
||
114 | | Overcast Day | 1075 | |
||
115 | | Very Dark Day | 107 | |
||
116 | | Twilight | 10.8 | |
||
117 | | Deep Twilight | 1.08 | |
||
118 | | Full Moon | 0.108 | |
||
119 | 1 | Alex Konig | | Quarter Moon | 0.0108 | |
120 | 8 | Alex Konig | | Starlight | 0.0011 | |
121 | 7 | Alex Konig | | Overcast Night | 0.0001| |
122 | |||
123 | 1 | Alex Konig | Source: https://www.engineeringtoolbox.com/light-level-rooms-d_708.html |
124 | 7 | Alex Konig | |
125 | 8 | Alex Konig | However, upon comparing values in data with archived weather predictions it seems more like the following table would be appropriate: |
126 | 7 | Alex Konig | |
127 | |_. Conditions |_. Value (k lux) | |
||
128 | | Direct sungligt | >60 | |
||
129 | 1 | Alex Konig | | Sunny | 40-60 | |
130 | | Overcast | 20-40 | |
||
131 | 24 | Alex Konig | | Cloudy | 0-20 | |
132 | 7 | Alex Konig | | Night | 0 | |
133 | 4 | Alex Konig | |
134 | 19 | Alex Konig | Used weather archive: https://www.in-pocasi.cz/archiv/archiv.php?historie=2019-12-01®ion=9 |
135 | 18 | Alex Konig | |
136 | 1 | Alex Konig | More detailed data analysis in [[Weather at ZCU]] |
137 | |||
138 | h3. JIS data |
||
139 | |||
140 | Link to data: http://opendata.zcu.cz/Snimace-JIS.html |
||
141 | |||
142 | Data contains: |
||
143 | |||
144 | * datum_a_cas - timestamp of JIS authentication (accuracy in milliseconds) |
||
145 | * pocet_logu - number of authentized users in given time |
||
146 | * popis_objektu - description of object according to standard ZČU tagging |
||
147 | |||
148 | On the linked page there is written that " ... Data about dorms, the entry to laboratories and other spaces with restricted access, informations about university canteen, checkouts in univeristy library, access to copy machines etc can be interesting for students ...". However not all of these places can be found in said data. In data from 2019 are present only 46 different places, and most of them are dorms, parking lots and buffets. |
||
149 | |||
150 | There is a possibility that in the future the number of logged places will increase, however it is also possible that the data was affected by GDPR and more detailed data now won't be provided for the public anymore. |
||
151 | 10 | Alex Konig | |
152 | 24 | Alex Konig | Possible solution is to assign provided spaces to buildings. More detailed data analysis of contained data in [[Jis activity - graphs]]. |
153 | 12 | Alex Konig | |
154 | |_\3. Dorms and gyms | |
||
155 | 24 | Alex Konig | |_. Dorm |_. Building |_. Location | |
156 | 31 | Alex Konig | | A1, A2-Hlavni vchod, A3, A2 | KARMA | on Borská street | |
157 | | B3-LEVY, B3-LevyVytah, B3-PRAVY, B3-PravyVytah, B3 | KBORY | on Baarova street | |
||
158 | | M16, M14 | KBORY | on Máchova street | |
||
159 | | L1, L2, L1L2-vchod | KLOCH | on Bolevecká street | |
||
160 | | L-Posilovna | KLOCH | in Bolevecká dorm | |
||
161 | | KL-Posilovna, K1 | KKLAT | on Klatovská street | |
||
162 | 12 | Alex Konig | |
163 | 1 | Alex Konig | |
164 | 11 | Alex Konig | |_\3. Parking lots | |
165 | 25 | Alex Konig | |_. Place |_. Building |_. Notes | |
166 | 1 | Alex Konig | | Zavora-FEL | FEL | | |
167 | 31 | Alex Konig | | Zavora-Kaplirova | - | on Kaplířova street | |
168 | 1 | Alex Konig | | US 005 - závora vjezd, US 005 - mříž vjezd | FAV | | |
169 | 20 | Alex Konig | | Zavora-FDU | FDU | | |
170 | 24 | Alex Konig | | Parkoviste-vjezd, Parkoviste-vyjezd | all on campus | | |
171 | 12 | Alex Konig | | Zavora-NTIS-vjezd, Zavora-NTIS-vyjezd | FAV | | |
172 | 31 | Alex Konig | | VC-VJEZD, VC-VYJEZD | SED+VEL | on Veleslavínova street | |
173 | 27 | Alex Konig | | KolaBory-vnejsi, KolaBory-vnitrni | FST+FEK | | |
174 | 24 | Alex Konig | | EXT/kola | FST+FEK | | |
175 | 12 | Alex Konig | | EXT/kola-B | FAV | | |
176 | 31 | Alex Konig | | B3-kolarna | KBORY | on Baarova street | |
177 | 1 | Alex Konig | |
178 | 12 | Alex Konig | |_\3. Food courts | |
179 | 24 | Alex Konig | |_. Name |_. Building |_. Notes | |
180 | 12 | Alex Konig | | EP-BUFET | FEL | | |
181 | | NTIS-BUFET | FAV | | |
||
182 | 24 | Alex Konig | | UV1-Bufet | FST+FEK | | |
183 | | MenzaKL-vydej | MENZA | | |
||
184 | | Menza4-kasa{x}| MENZA | x in range <1, 5> | |
||
185 | | Menza1-kasa-l, Menza1-kasa-p | MENZA | | |
||
186 | 1 | Alex Konig | |
187 | 12 | Alex Konig | |_\3. Study rooms | |
188 | 24 | Alex Konig | |_. Room |_. Building |_. Notes | |
189 | 31 | Alex Konig | | STUD_VC53 | SED+VEL| on Veleslavínova street | |
190 | | STUD_KL20, STUD_KL87 | KLAT | on Klatovská street | |
||
191 | | STUD_PRA1 | SADY | | |
||
192 | 24 | Alex Konig | | STUD_UB113, STUD_UB211 | LIB | in the on campus library | |
193 | 31 | Alex Konig | | STUD_ST407 | SED+VEL | | |
194 | 4 | Alex Konig | |
195 | |||
196 | h3. WebAuth data |
||
197 | |||
198 | Link to data: http://opendata.zcu.cz/Autentizacni-system.html |
||
199 | 3 | Alex Konig | |
200 | 4 | Alex Konig | Data contains: |
201 | 1 | Alex Konig | |
202 | 4 | Alex Konig | * datum - date of access |
203 | 1 | Alex Konig | * budova - building tag |
204 | 4 | Alex Konig | * hodina_zacatek - start of lecture |
205 | * hodina_konec - end of lecture |
||
206 | * pocet_prihlaseni - number of successfull sign-ins to given computer in given lecture |
||
207 | 1 | Alex Konig | * stroj_hostname - name of specific computer |
208 | * typ_objektu - type of object (classroom, laboratory, lecture room, other) |
||
209 | * ucebna_nazev - specific name of room |
||
210 | 13 | Alex Konig | * vyucovaci_hodina - number of lecture (according to the timetable) |
211 | 4 | Alex Konig | |
212 | 1 | Alex Konig | On the linked page there is written that "... Signing in using orion login and password can also help track sign-ins to computers at ZČU and corresponding activity in computer laboratories ..." however it seems quesstionable if really all computer logins are in this data. Since it contains only 106 different rooms for all of ZČU in data from the year 2019, which seems suspicious especially since some rooms that we know that they are equipped with computers and are being used (at least sometimes) are not present. |
213 | 18 | Alex Konig | |
214 | So, it would be possible to again assign those rooms to the appropriate buldings using the table at the beggining of ZČU open data chapter and go off the assumption that a similar set of students will be attending lessons in the same building (which is often the case at least with KIV lectures). |
||
215 | 4 | Alex Konig | |
216 | More detailed data analysis in [[Login activity at ZCU - graphs]]. |
||
217 | |||
218 | 1 | Alex Konig | |
219 | 4 | Alex Konig | h3. Occupancy data |
220 | |||
221 | Link to data: http://opendata.zcu.cz/Obsazeni-mistnosti.html |
||
222 | |||
223 | 12 | Alex Konig | Data contains: |
224 | |||
225 | * rok_platnosti - year |
||
226 | * budova - building tag |
||
227 | * ucebna_nazev - room name |
||
228 | * typ_objektu - type of room (učebna/laboratoř/posluchárna/jiné) |
||
229 | * kapacita_objektu - maximum capacity of room |
||
230 | * obsazeni - number of students enlisted |
||
231 | * predmet - abbreviation of timetable action |
||
232 | * typ_akce - type of lecture (seminář/přednáška/cvičení) |
||
233 | * vyucovaci_hodina - lesson number (according to the timetable) |
||
234 | * hodina_zacatek - lesson beggining |
||
235 | 3 | Alex Konig | * hodina_konec - lesson end |
236 | 1 | Alex Konig | * semestr - semester (Letní semestr/Zimní semestr) |
237 | * tyden - week (S(even), L(odd), K(every),J(other)) |
||
238 | * tyden_v_roce - week in the year |
||
239 | 15 | Alex Konig | * datum - date |
240 | |||
241 | It seems possible that not all lessons that are taught on ZČU are included in this data. Data from 2019+2020 contains only 1202 unique lesson instances. |
||
242 | 1 | Alex Konig | Also there are some instances without assigned building and room name, however this shouldn't be an issue since lessons are usually looked up by their abbrevation, not by room. |
243 | 15 | Alex Konig | How to work with lessons that are not included in these datasets is rather a topic either for [[Prediction models]] or handling user input. |
244 | 1 | Alex Konig | |
245 | |||
246 | h2. Weather data |
||
247 | |||
248 | Link to data: http://wttr.in/Plzen,czechia?format=j1 |
||
249 | |||
250 | 15 | Alex Konig | Data is in json file format and contains detailed weather prediction for Pilsen, CZ. For this application will be usefull mainly the following details: |
251 | 1 | Alex Konig | |
252 | 17 | Alex Konig | Current weather: |
253 | 15 | Alex Konig | * localObsDateTime - date and time |
254 | * cloudcover - amount of clouds (values in range <0-100>) |
||
255 | 17 | Alex Konig | * temp_C - temperature (°C) |
256 | 1 | Alex Konig | * windspeedKmph - wind (km/h) |
257 | 15 | Alex Konig | |
258 | 36 | Alex Konig | Prediction contains hourly prediction for following information: |
259 | * tempC - temperature (°C) |
||
260 | 1 | Alex Konig | * WindGustKmph - wind (km/h) |
261 | 15 | Alex Konig | * chanceofrain - chance of rain (0-100%) |
262 | * cloudcover - amount of clouds (values in range <0-100>) |
||
263 | |||
264 | 36 | Alex Konig | From cloudcover can be estimated values such as sunny/overcast and cloudy. |
265 | * 0-33 - sunny |
||
266 | * 33-66 - cloudy |
||
267 | * 66-100 - overcast |
||
268 | |||
269 | For current rain probability we need to consult predictions for today. |