1
0
mirror of https://github.com/vimagick/dockerfiles.git synced 2024-11-24 08:52:31 +02:00
dockerfiles/openrefine/reconcile-csv
2017-05-08 07:05:07 +08:00
..
data add reconcile-csv 2016-07-06 18:41:50 +08:00
docker-compose.yml add reconcile-csv 2016-07-06 18:41:50 +08:00
Dockerfile swith to new domain: easypi.pro 2017-05-08 07:05:07 +08:00
README.md add reconcile-csv 2016-07-06 18:41:50 +08:00

reconcile-csv

Reconcile-csv is a reconciliation service for OpenRefine running from a CSV file. It uses fuzzy matching to match entries in one dataset to entries in another dataset, helping to introduce unique IDs into the system - so they can be used to join your data painlessly.

docker-compose.yml

reconcile-csv:
  image: vimagick/openrefine-reconcile-csv
  ports:
    - "8000:8000"
  volumes:
    - ./data:/data
  environment:
    - JAVA_OPTS=-Xmx2g
    - CSV_FILE=input.csv
    - SEARCH_COLUMN=name
    - ID_COLUMN=id
  restart: always

input.csv

id,name
1,kevin
2,tom
3,sarah
4,mike
5,lucy

up and running

$ docker-compose up -d
$ curl http://localhost:8000/reconcile?query=kev
$ curl http://localhost:8000/reconcile?query={%22query%22:%22kev%22,%22limit%22:1}
$ curl http://localhost:8000/view/1