Skip to content

Latest commit

 

History

History
183 lines (153 loc) · 8.08 KB

capture.en.adoc

File metadata and controls

183 lines (153 loc) · 8.08 KB

Data capture

Note
In this module, you will learn about the concept of standards, in particular, the Darwin Core Standard and its components. You will also learn the types of primary biodiversity data and how to best share that information within GBIF. Lastly, you will review principles of data quality in the context of data capture and will learn about data quality and coherence (especially on subjects such as georeferencing, dates, names and taxa cross-checking).

Standards and Darwin Core

Note
In this video (15:37), you will learn how you interact with standards every day. Then you will be introduced to Biodiversity Information Standards, including the Darwin Core Standard with which you will continue to use throughout this course. If you are unable to watch the embedded video, you can download it locally. (MP4 - 27 MB)

Data origins and types

Note
In this video (10:45), you will review primary biodiversity data that can be shared within GBIF. If you are unable to watch the embedded video, you can download it locally. (MP4 - 19 MB)

Questions

  • Is your data type different than you originally thought?

  • With what kind of data do you work?

  • How would you publish your data to GBIF (using which Core and/or extension)?

Data capture, processing and quality

Note
In this video (09:11), you will explore the principles of data quality applied to data capture, specifically when capturing data from collection labels, fieldwork notebooks, spreadsheets, etc. If you are unable to watch the embedded video, you can download it locally. (MP4 - 19 MB)

Exercise 2

Note
For this activity, you will complete an exercise simulating data capture. You will begin to work with Darwin Core terms and make decisions on data that is needed for your organization/project and you will consider which of that data will be shared later during publication.

Read USE CASE I (if you haven’t already).

Imagine that you are the person assigned to transcribe the data found on the herbarium sheets.

  1. Download UC1-2-base-material.zip. (34.4 MB). There are 10 images. Two images per specimen for a total of five specimens. The herbarium sheets are in Spanish (data may come to you in various means and in other languages than your own), but you should be able to recognize the data contained in the fields on the labels. Remember to use both images per record to compile the information.

  2. Download the spreadsheet template: UC1-2-occurrence-template.xlsx (57.3 KB) to transcribe the information found on each of the images for the five specimens.

  3. Use the previously downloaded exercise sheet to provide your answers.

Note
you may need to add fields to the spreadsheet as you may be able to capture more information from the labels than was planned for in the template.

Review

Note
Quiz yourself on the concepts learned in this section.
  1. What dataset type(s) would you choose for an ichthyology collection?

    :figure-caption!:
    .https://www.gbif.org/occurrence/441867173[Eutrigla gurnardus (Linnaeus, 1758)] | Muséum d’histoire naturelle de Nice
    image::img/web/QDataTypes-specimen.png[align=center,width=640,height=360]
    
    - [x] occurrence
    - [x] checklist
    - [ ] sampling event
  2. What dataset type(s) would you choose for a list of invasive species?

    :figure-caption!:
    .Water hyacinth (Eichhornia crassipes) observed in Bourail, New Caledonia, where it is an introduced and invasive species by GRIIS. Photo by gérard (2016) licensed under CC BY-SA 2.0
    image::img/web/QDataTypes-plant.png[align=center,width=640,height=360]
    
    - [x] occurrence
    - [x] checklist
    - [ ] sampling event
  3. What dataset type(s) would you choose for the flora and fauna of an environmental impact study?

    Environmental impact assessment studies are done by experts in order to assess the biodiversity and biotopes of a given area, before, during and after it is affected by human activities (road works, wind turbines, mining, building construction, etc.).
    
    :figure-caption!:
    .https://upload.wikimedia.org/wikipedia/commons/6/64/Entomologist_chasing_buterflies.jpg[Entomologist chasing butterflies] by Matthieu Gauvain (CC-BY-SA)
    image::img/web/QDataTypes-field.png[align=center,width=640,height=360]
    
    - [x] occurrence
    - [ ] checklist
    - [x] sampling event
  4. What dataset type(s) would you choose for bird tracking data?

    Bird-tracking data are recorded using specific devices, such as GPS trackers mounted on live birds, thus allowing scientists to track their migratory routes or breeding sites.
    
    :figure-caption!:
    .https://commons.wikimedia.org/wiki/File:Griffon_vulture_at_Gamla_nature_reserve.jpg[Griffin vulture] observed at Gamla Nature Reserve by מינוזיג - MinoZig (CC0)
    image::img/web/QDataTypes-tracking.png[align=center,width=640,height=360]
    
    - [x] occurrence
    - [ ] checklist
    - [ ] sampling event
  5. What dataset type(s) would you choose for insect trap data?

    :figure-caption!:
    .https://www.flickr.com/photos/miheco/215305679/[Insect trap] by miheco (CC-BY-SA)
    image::img/web/QDataTypes-traps.png[align="center",width=640,height=360]
    
    - [x] occurrence
    - [ ] checklist
    - [x] sampling event
  6. What dataset type(s) would you choose for national park management data?

    Data acquired in the context of protected areas management (such as national parks but also smaller nature reserves) can be diverse and have different origins: botanical surveys, tagged animals tracking, observations from rangers and guards, and even ‘citizen science’ data or data inferred from pictures shared on social medias.
    
    :figure-caption!:
    .https://pixabay.com/photos/%C3%A9l%C3%A9phant-%C3%A9l%C3%A9phant-d-asie-4037451/[Sri Lankan elephants] observed by pen_ash.
    image::img/web/QDataTypes-Observations.png[align=center,width=640,height=360]
    
    - [x] occurrence
    - [x] checklist
    - [x] sampling event
  7. What dataset type(s) would you choose for a citizen science bioblitz?

    Citizen science data are often collected through thematic fieldwork days known as a “bioblitz.” Volunteers typically gather in a given area and spend the day trying to observe and identify as many species as they can in this area.
    
    Data from each participant are captured and merged into the citizen science programme’s data capture or data management tool.
    
    :figure-caption!:
    .https://www.nps.gov/experiences/citizen-scientists-bioblitz.htm?fullweb=1[Looking for birds with park staff] by US National Park Service (authorized reuse on google image search)
    image::img/web/QDataTypes-citizen.png[align=center,width=640,height=360]
    
    - [x] occurrence
    - [ ] checklist
    - [x] sampling event
  8. What dataset type(s) would you choose for a regional species list?

    :figure-caption!:
    .https://pixabay.com/photos/zoo-de-magdebourg-makibo-2876837/[Black rhino] observed at the Magdeburg Zoo in Germany by Mani300
    image::img/web/QDataTypes-threatened.png[align=center,width=640,height=360]
    
    - [ ] occurrence
    - [x] checklist
    - [ ] sampling event