Marcin Chojnacki − His main involvement in the project has been the acquisition of data from external database systems. Due to the nature and scope of Shared Cities, the data came primarily from Facebook (open API) and newsletters about various Katowice events, such as e.g. silesiaspace.pl. This includes both posts, comments, posts from the pages suggested by the other members of the workgroup. What lies in the pipeline for him as part of the project is changing of the format of the available data, so that the information from Facebook can be combined with the data describing the cultural circuits of other services, e.g. Katowice’s local Ultramaryna.
Dawid Górny − His responsibilities include, among others, the selection of tools and environments that the entire workgroup uses for communication and data work. The key criterion in selecting software is its capacity for exchanging and combining different types of data. Along with Marcin, he prepares scripts to download data from e.g. social media. He is also involved in developing new methods of obtaining data from other sources. In the latter part of the project, he is going to create tools and interfaces for data analysis and online presentation.
- Łukasz Mirocha: What steps do you need to take to transform a collection of raw data into knowledge that is intelligible to a wide audience?
Marcin: Raw data requires appropriate processing. External services do not always allow direct access to the data we are interested in. We need to systematise it first, then save it locally on our server to keep its current state and have continuous access to it. This is a necessary prerequisite for the ‘proper’ research to begin.
Dawid: Also, you mustn’t forget about the essentials – before you download any data, you need to prepare the server, the repository, the database, and any scripts you may need, as these are the key tools without which the research work using the acquired data would simply be impossible.