Papers
Topics
Authors
Recent
Search
2000 character limit reached

Total Error Sheets for Datasets (TES-D) -- A Critical Guide to Documenting Online Platform Datasets

Published 25 Jun 2023 in cs.CY and cs.DB | (2306.14219v1)

Abstract: This paper proposes a template for documenting datasets that have been collected from online platforms for research purposes. The template should help to critically reflect on data quality and increase transparency in research fields that make use of online platform data. The paper describes our motivation, outlines the procedure for developing a specific documentation template that we refer to as TES-D (Total Error Sheets for Datasets) and has the current version of the template, guiding questions and a manual attached as supplementary material. The TES-D approach builds upon prior work in designing error frameworks for data from online platforms, namely the Total Error Framework for digital traces of human behavior on online platforms (TED-On, https://doi.org/10.1093/poq/nfab018).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (7)
  1. Data statements for natural language processing: Toward mitigating system bias and enabling better science. Transactions of the Association for Computational Linguistics, 6:587--604.
  2. Datasheets for datasets. Communications of the ACM, 64(12):86--92.
  3. Survey Methodology. John Wiley & Sons.
  4. Total survey error: Past, present, and future. Public Opinion Quarterly, 74(5):849--879.
  5. Social data: Biases, methodological pitfalls, and ethical boundaries. Frontiers in Big Data, 2:13.
  6. “call me sexist, but...”: Revisiting sexism detection using psychological scales and adversarial samples. In Proceedings of the International AAAI Conference on Web and Social Media, volume 15, pages 573--584.
  7. A total error framework for digital traces of human behavior on online platforms. Public Opinion Quarterly, 85(S1):399--422.
Citations (1)

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.