Home
Articles
Bookmarks
Wiki
rdnet
romain.dorgueil.net
Bookmarks
All
Public
#etl
Launch - AWS Glue Now Generally Available
https://aws.amazon.com/blogs/aws/launch-aws-glue-now-generally-available/?sc_channel=sm&sc_campaign=Serverless%2CAWS_Blog&sc_publisher=TWITTER&sc_country=Global&sc_geo=GLOBAL&sc_outcome=awareness&trk=_TWITTER&sc_content=blog_top_posts_aug_2A&sc_category=AWS_Glue&linkId=42400853
aws
serverless
etl
Engineers Shouldn’t Write ETL: A Guide to Building a High Functioning Data Science Department
http://multithreaded.stitchfix.com/blog/2016/03/16/engineers-shouldnt-write-etl/
engineering
data-science
etl
Data Processing for Humans
https://bonobo-project.org/
bonobo
data-processing
humans
data-science
data-transformation
library
python
etl
Apache NiFi
https://nifi.apache.org/
data-processing
nifi
example
data
gui
extract-transform-load
etl
apache
pawl/awesome-etl: A curated list of awesome ETL frameworks, libraries and software.
https://github.com/pawl/awesome-etl
etl
libraries
awesome
extract-transform-load
list
tools
python
Record linkage - Wikipedia
https://en.wikipedia.org/wiki/Record_linkage
datalake
warehouse
nlp
data-processing
etl
data
record-linkage
Dask - dask 0.10.1 documentation
http://dask.readthedocs.io/
data-processing
pydata
tools
processing
python
parallel
dask
etl
distributed
sql
computing
database
distributed-computing
Ecosystem - Blaze 0.10.2rc1+5.g87dd886 documentation
http://blaze.readthedocs.io/
tabular
data-processing
data-science
statistics
pydata
tools
blaze
python
etl
sql
database
access APIs and webservices from your own tools â Blockspring
https://www.blockspring.com/
google-spreadsheet
blockspring
data
api
analytics
excel
automation
etl
spreadsheet
Map Reduce for Golang
https://blog.gopheracademy.com/advent-2015/glow-map-reduce-for-golang/
map-reduce
data-processing
golang
big-data
etl
data-integration
glow
«
1
2
»
Tags
data-processing
data
data-science
python
tools
big-data
data-integration
database
extract-transform-load
golang
pydata
sql
analytics
apache
api
automation
awesome
aws
blaze
blockspring
bonobo
computing
concurrency
cursor
dask
data-transformation
datalake
distributed
distributed-computing
django
engineering
example
excel
glow
go-language
google
google-refine
google-spreadsheet
gui
humans
libraries
library
list
map-reduce
nifi
nlp
open-refine
parallel
postgresql
processing
programming
psycopg2
record-linkage
refine
serverless
slides
spreadsheet
statistics
tabular
tool
warehouse
13 results found in 21.51 ms
JSON