Understanding Data#
Beginning a data-focused project can be daunting! This section will help you:
Interpret why data was collected and examine its perceived limitations
Understand the basics of data licensing and access
Identify issues that may aid or hinder reproducibility
Know where to get started with finding data
Data Analysis#
A typical Data Analysis approach takes the following path:
Here, we invite you to consider the addition of three components, Data Biography, Licensing, and Archiving and Publishing, each marked in the revised approach below:
Data biography#
Who collected the data?
How did they collect it?
For what purpose?
How is it used? By whom?
What are its impacts? On whom?
What are the known limitations?
Licensing#
How can you use data?
How do you know?
At a minimum, generate a citation for the data that you will use.
Find a license. If there is a license, what does it mean?
Consult Creative Commons to learn about licenses and their implications.
Regardless of the license, always include a citation.
Archiving and Publishing#
For more information about archiving and publishing options, consult the section on Archival Repositories.
[2] Data Biography and Data Analysis Images based on: Catherine D’Ignazio. Creative data literacy: bridging the gap between the data-haves and data-have nots. Information Design Journal, 23(1):6–18, 2017.