Sarah Hatoum
Sarah Hatoum
  • 3
  • 18 802
Introduction to OpenRefine - Skillshare
This video was created for a Digital Humanities course at Pratt Institute's School of Information and is an introduction to OpenRefine, an open source data cleaning and manipulation software. I will introduce you to the software, show you how to allocate more memory, and add a statistics extension. The bulk of the video has to do with using basic features of OpenRefine; I will guide you through the features using a public humanities dataset from NYPL's What's on the Menu? crowdsourcing project.
Please feel free to jump around to different sections of the video based on your interests.
How to download: 0:54-1:44
Dataset: 1:45-5:35
Allocating more memory: 5:36-7:47
Using basic features: 7:28-26:16
Adding stats extension: 26:17-27:22
Ultimately, I hope this video acts as a springboard for learning more about OpenRefine and pursuing DH projects.
มุมมอง: 18 740

วีดีโอ

US Gov
มุมมอง 259 ปีที่แล้ว
US Gov
LibGuide - U.N. Database
มุมมอง 389 ปีที่แล้ว
LibGuide - U.N. Database

ความคิดเห็น

  • @lhodeniz
    @lhodeniz 4 หลายเดือนก่อน

    I wonder you're cutting dataset in MS Excel while openrefine can do it well.

  • @michaelw.881
    @michaelw.881 4 ปีที่แล้ว

    Thanks dear for this nice introduction.

  • @richardschavez9041
    @richardschavez9041 6 ปีที่แล้ว

    Hi! A lot of thanks to this video. Very helpful. Learned a lot even to this single video. 😇👌 🙇

  • @xipozz
    @xipozz 6 ปีที่แล้ว

    Awesome! Thank you so much!

  • @NavidNobani
    @NavidNobani 7 ปีที่แล้ว

    Opening the file ( the same dish.csv file) with OpenRefine, the default column separator is on "customer" which shows the data like a csv and not in a table and Chenging it to "commas(CSV)" messes up all the column completely. What I'm doing wrong here?

    • @hilgeflupp896
      @hilgeflupp896 3 ปีที่แล้ว

      It's probably a bit late, but maybe helpful for others: Don't open the CSV file in Excel. In some countries Excel uses Commas for decimal numbers (instead of a dot like in the US) and doesn't understand the meaning of the commas in a CSV-file. Opening the file in Excel will corrupt it, and that's why the file shows up completely wrong in OpenRefine. Use an Editor instead to open and editn the file.

  • @tomasmantilla
    @tomasmantilla 7 ปีที่แล้ว

    Much better than reading instructive blog posts, thanks!

  • @datadiscovery180
    @datadiscovery180 8 ปีที่แล้ว

    Thanks Sarah!!

  • @darkeuphoria01
    @darkeuphoria01 8 ปีที่แล้ว

    Thank you so much for this.