Tools

From CSV on the Web Working Group Wiki
Revision as of 12:16, 5 February 2014 by Dceolin (Talk | contribs)

Jump to: navigation, search

CSVKit

A suite of utilities for converting to and working with CSV, the king of tabular file formats

https://github.com/onyxfish/csvkit

CSVLint

WIP?

http://csvlint.io/ - https://github.com/theodi/csvlint

CSV Schema Language and CSV Validator

The National Archives (UK)

https://github.com/digital-preservation/csv-validator/

XSLT for converting CSV to XML: https://github.com/digital-preservation/csv-tools/

Datapipes

Data Pipes is a service to provide streaming, "pipe-like" data transformations on the web – things like deleting rows or columns, find and replace, head, grep etc.

http://datapipes.okfnlabs.org

Python's CSV module

http://docs.python.org/release/2.7/library/csv.html

It is part of the official Python distribution since version 2.3. Provides a simple shell around CSV files to read or write line-by-line, possibly use the header row's elements as keys. It is customizable to handle different separation characters (comman, tab, etc.). It uses Python's iteration model around the standard file object, meaning that it is usable with possibly large files.

Tablinker

Tablinker is an experimental software for converting manually annotated Microsoft Excel workbooks to the RDF Data Cube vocabulary.

https://github.com/Data2Semantics/TabLinker

PLSheet

PLSheet is a SWI-Prolog library for analyzing ODF spreadsheets.

https://github.com/Data2Semantics/PLSheet

Harmonize

Harmonize is a tool (prototype) for normalizing and aligning RDF Data Cube datasets, and getting CSVs out of it.

http://lod.cedar-project.nl:8082/harmonize

https://github.com/CEDAR-project/Harmonize