University of Sussex
Browse
Open Refine Guide.pdf (614.8 kB)

Beyond Excel: how to start cleaning data with OpenRefine

Download (614.8 kB)
journal contribution
posted on 2023-06-09, 02:31 authored by Antony GrovesAntony Groves
Within our different roles as information professionals, we are all expected to handle larger and larger amounts of data, from the resources we manage to the analytics we collect. However as this data gets bigger it can become harder to analyse. Ham explains that this is often due to errors and inconsistencies in the collection and management of data (2013, p.233), not to mention the time involved in learning how to analyse all of this information, along with the analysis itself. The following guide hopes to address some of these issues by introducing readers to OpenRefine (formerly Google Refine), an open source piece of software that can help to remove some of the errors and inconsistencies in datasets, in a timely manner, without expert knowledge being required.

History

Publication status

  • Published

File Version

  • Accepted version

Journal

Multimedia Information and Technology

ISSN

1466-190X

Publisher

Chartered Institute of Library and Information Professionals: Multimedia Information and Technology Group

Issue

2

Volume

42

Page range

18-22

Full text available

  • Yes

Peer reviewed?

  • No

Legacy Posted Date

2016-08-12

First Open Access (FOA) Date

2016-08-12

First Compliant Deposit (FCD) Date

2016-08-11

Usage metrics

    University of Sussex (Publications)

    Categories

    No categories selected

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC