Type: Feature Requestclusteringhelp wantedperformance
描述
Hi I am a research student from Database Group , Tsinghua University and we have developed a java clustering library https://github.com/lispc/EditDistanceClusterer which is much faster than the current simile-vicino used in OpenRefine. I wonder whether it is possible to integrate the lib to OpenRefine. What features / tests / performance reports are needed ? (First time to do a open-source pull request, sorry for anything not corretly done)