This repo includes the code used for implementing SiMa, as described in "SiMa: Effective and Efficient Matching Across Data Silos Using Graph Neural Networks".
src
Contains python source files used for developing SiMa and getting effectiveness results.notebooks
Containsmatching_data_silos.ipynb
notebook for easily comprehending the pipeline of SiMa. It uses an example of SiMa running on the data silo configuration derived from NYC OpenData (used for the paper). In addition, it shows how SiMa compares with COMA and Starmie in terms of effectiveness.
Datasets and datafiles used can be found here.