OpenRefine/OpenRefine

Integrate PassJoin cluster library

Open

#983 建立於 2015年4月21日

在 GitHub 查看
 (16 留言) (1 反應) (0 負責人)Java (10,056 star) (1,891 fork)batch import
Type: Feature Requestclusteringhelp wantedperformance

描述

Hi I am a research student from Database Group , Tsinghua University and we have developed a java clustering library https://github.com/lispc/EditDistanceClusterer which is much faster than the current simile-vicino used in OpenRefine. I wonder whether it is possible to integrate the lib to OpenRefine. What features / tests / performance reports are needed ? (First time to do a open-source pull request, sorry for anything not corretly done)

貢獻者指南