1
0
mirror of https://github.com/vimagick/dockerfiles.git synced 2024-12-23 01:39:27 +02:00
dockerfiles/openrefine/reconcile-csv/README.md
2016-07-06 18:41:50 +08:00

968 B

reconcile-csv

Reconcile-csv is a reconciliation service for OpenRefine running from a CSV file. It uses fuzzy matching to match entries in one dataset to entries in another dataset, helping to introduce unique IDs into the system - so they can be used to join your data painlessly.

docker-compose.yml

reconcile-csv:
  image: vimagick/openrefine-reconcile-csv
  ports:
    - "8000:8000"
  volumes:
    - ./data:/data
  environment:
    - JAVA_OPTS=-Xmx2g
    - CSV_FILE=input.csv
    - SEARCH_COLUMN=name
    - ID_COLUMN=id
  restart: always

input.csv

id,name
1,kevin
2,tom
3,sarah
4,mike
5,lucy

up and running

$ docker-compose up -d
$ curl http://localhost:8000/reconcile?query=kev
$ curl http://localhost:8000/reconcile?query={%22query%22:%22kev%22,%22limit%22:1}
$ curl http://localhost:8000/view/1