Data Package Pipelines

Framework for processing data packages in pipelines of modular components.

Repository

datapackage-py

A Python library for working with Data Packages.

Repository

datapackage-js

Official JavaScript library for Data Packages in Node and the browser.

Repository

goodtables-py

Validate and process tabular data in Python.

Repository

try.goodtables.io

A web service to validate and process tabular data.

Website Issue Tracker

tableschema-py

A Python library for working with Table Schema.

Repository

tableschema-js

A library for working with Table Schema in Javascript.

Repository

Metatab

Python language parser for a tabular format for structured metadata.

Website Repository

datapak

A library to work with tabular data packages.

Repository

Datapaka

An easy interface for documenting data packages.

Repository

DPMR

R Data Package Manager.

Repository

Open Food Hackdays Portal

A publicly available repository of nutrition data Data Packages, collated by School of Data Switzerland, for use in Open Food Data projects

Repository Data Portal Website

Data Curator

Desktop CSV editor to help describe, validate and share usable open data

Website Repository

Data Package Inspector

A tool inspired by Data Package Viewer and set up by BCO-DMO to visualize Frictionless Data Data Packages

Repository Website

datapackage-rb

A Ruby library for working with Data Packages.

Repository

Tesera

Tesera publishes a variety of Data Package-aware tools.

Website Case Study Repository

tableschema-php

A PHP library for working with Table Schema.

Repository

Import for Google Spreadsheets experimental

Import Tabular Data Packages into Google Spreadsheets.

Repository

tableschema-clj

A Clojure library for working with Table Schema.

Repository

json-table

A validator and storage library for working with Table Schema.

Repository

Comma Chameleon

A desktop CSV editor for data publishers.

Website Repository

Data.World

Data.world provides all datasets as Data Packages.

Website Case Study

datapackage-connector

Power BI Custom Connector for loading tables directly from Tabular Data Packages into Power BI through the 'Get Data' experience.

Repository

pandas-datapackage-reader

Data Package reader for Pandas.

Repository

tableschema-go

A Go library for working with Table Schema.

Repository

RODProt

R Open Data Protocols Library.

Repository

tableschema-r

An R library for working with Table Schema.

Repository

Data Package Viewer service

View Data Package metadata in human-readable form.

Website Repository

DataPackage.jl

A Julia library for working with Data Packages.

Repository

Stenci.la coming soon

The office suite for reproducible research

Website Repository

Central de Dados

A repository of Open Data, archived using the data package format, in Portugal.

Repository Website

datapackage-m

Power Query M functions for working with Tabular Data Packages in Power BI and Excel.

Repository

CSVDDF-Python

CSVDDF support for Python.

Repository

Octopub

Octopub provides a platform to publish CSV data on an automatically created webpage.

Website Repository

TableSchema.jl

A Julia library for working with Table Schema.

Repository

Data Central

A lightweight platform to easily publish and distribute datasets.

Repository

tableschema-java

A Java library for working with Table Schema.

Repository

ERD Table Schema

Create an ERD for a database given as Table Schema.

Repository

datapackage-php

A PHP library for working with Data Packages.

Repository

HarvestChoice

HarvestChoice publishes its bulk agricultural data as zipped Data Packages.

Website

Open Power System Data

Open Power System Data develops a free-of-charge platform for open data dedicated to electricity system researchers.

Repository Website Case Study

tableschema-pandas-py

Table Schema to Pandas module for jsontableschema-py.

Repository

BIML-enabled Tabular Data Package Importer

BIML (Business Intelligence Markup Language) is a project that uses datapackage.json to generate SSIS packages that can load the contents of a Tabular Data Package into a SQL Server database.

Repository

Data Quality Dashboard

Data Quality Dashboards display statistics on a collection of published data.

Repository

Open Refine 3.0

OpenRefine is a powerful tool for working with messy data. v3 includes support of Data Package metadata standards to describe and package a collection of data.

Repository Blog Website

Mira

Create simple APIs from CSV files.

Repository

tableschema-biqquery-py

Table Schema to BigQuery module for tableschema-py.

Repository

the-el

Command-line tool by the City of Philadelphia to extract and load SQL tables using Table Schema, complete with Carto support.

Repository

Kaggle API

The Kaggle API follows the Data Package specification for specifying metadata when creating new Datasets and Dataset versions.

Repository Wiki Website

Dataship

Dataship is a way to share data and analysis, from simple charts to complex machine learning, with anyone in the world easily and for free.

Repository Website Case Study

data-cli

command line tool for working with Data Packages.

Website Repository

datapackage-go

A Go library for working with Data Packages.

Repository

datapackage-clj

A Clojure library for working with Data Packages.

Repository

tableschema-sql-py

Table Schema to SQL module for tableschema-py.

Repository

Data Package Creator

A web service for creating Data Packages.

Website Repository

csvlint-rb

A ruby gem to support validating CSV files to check their syntax and contents.

Repository

CSV Lint

CSV Lint is a webservice for validating tabular data.

Website Repository

datapackage-r

An R library for working with Data Packages.

Repository

Data Retriever

The Data Retriever uses the Data Package format internally. It is a package manager for data. It downloads, cleans, and stores publicly available data, so that analysts spend less time cleaning and managing data, and more time analyzing it.

Website Repository

PostgreSQL Table Schema

Create Table Schema from a live PostgreSQL database.

Repository

Data Factory

An open framework and toolkit for creating data flows to collect, inspect, process and publish data.

Website

datapackage-matlab

a MATLAB function to read data from a Tabular Data Package.

Repository

datapackage-java

A Java library for working with Data Packages.

Repository

SmartCSV.fx

A simple JavaFX application to load, save and edit a CSV file and provide a JSON configuration for columns to check the values in the columns.

Website Repository

tabulator-py

Consistent interface for stream reading and writing tabular data (csv/xls/json/etc).

Repository
bookdocsexternal fforumgithubgitterheartpackageplayrocket softwaretools