Home
Articles
Bookmarks
Wiki
rdnet
romain.dorgueil.net
Bookmarks
All
Public
#data-processing
#etl
Data Processing for Humans
https://bonobo-project.org/
bonobo
data-processing
humans
data-science
data-transformation
library
python
etl
Apache NiFi
https://nifi.apache.org/
data-processing
nifi
example
data
gui
extract-transform-load
etl
apache
Record linkage - Wikipedia
https://en.wikipedia.org/wiki/Record_linkage
datalake
warehouse
nlp
data-processing
etl
data
record-linkage
Dask - dask 0.10.1 documentation
http://dask.readthedocs.io/
data-processing
pydata
tools
processing
python
parallel
dask
etl
distributed
sql
computing
database
distributed-computing
Ecosystem - Blaze 0.10.2rc1+5.g87dd886 documentation
http://blaze.readthedocs.io/
tabular
data-processing
data-science
statistics
pydata
tools
blaze
python
etl
sql
database
Map Reduce for Golang
https://blog.gopheracademy.com/advent-2015/glow-map-reduce-for-golang/
map-reduce
data-processing
golang
big-data
etl
data-integration
glow
Very Large Result Sets in Django using PostgreSQL
http://thebuild.com/blog/2010/12/13/very-large-result-sets-in-django-using-postgresql/
data-processing
psycopg2
big-data
etl
django
postgresql
cursor
OpenRefine
http://openrefine.org/
refine
data-processing
open-refine
tool
data-science
data
etl
data-integration
google
google-refine
Tags
data
data-science
python
big-data
data-integration
database
pydata
sql
tools
apache
blaze
bonobo
computing
cursor
dask
data-transformation
datalake
distributed
distributed-computing
django
example
extract-transform-load
glow
golang
google
google-refine
gui
humans
library
map-reduce
nifi
nlp
open-refine
parallel
postgresql
processing
psycopg2
record-linkage
refine
statistics
tabular
tool
warehouse
8 results found in 15.84 ms
JSON