Package: blocking Type: Package Title: Various Blocking Methods for Entity Resolution Version: 1.0.3 Authors@R: c(person(given = "Maciej", family = "Beręsewicz", role = c("aut", "cre"), email = "maciej.beresewicz@ue.poznan.pl", comment = c(ORCID = "0000-0002-8281-4301")), person(given = "Adam", family = "Struzik", role = c("aut", "ctr"), email = "adastr5@st.amu.edu.pl")) Description: The goal of 'blocking' is to provide blocking methods for record linkage and deduplication using approximate nearest neighbour (ANN) algorithms and graph techniques. It supports multiple ANN implementations via 'rnndescent', 'RcppHNSW', 'RcppAnnoy', and 'mlpack' packages, and provides integration with the 'reclin2' package. The package generates shingles from character strings and similarity vectors for record comparison, and includes evaluation metrics for assessing blocking performance including false positive rate (FPR) and false negative rate (FNR) estimates. For details see: Papadakis et al. (2020) , Steorts et al. (2014) , Dasylva and Goussanou (2021) , Dasylva and Goussanou (2022) . License: GPL-3 Encoding: UTF-8 LazyData: true URL: https://github.com/ncn-foreigners/blocking, https://ncn-foreigners.ue.poznan.pl/blocking/ BugReports: https://github.com/ncn-foreigners/blocking/issues Roxygen: list(markdown = TRUE) Imports: text2vec, tokenizers, RcppHNSW, RcppAnnoy, mlpack, rnndescent, igraph, data.table, methods, readr, utils, Matrix Suggests: tinytest, knitr, rmarkdown, reclin2 VignetteBuilder: knitr Depends: R (>= 4.1.0) Config/roxygen2/version: 8.0.0 Config/pak/sysreqs: libglpk-dev libicu-dev libxml2-dev libx11-dev Repository: https://ncn-foreigners.r-universe.dev Date/Publication: 2026-06-14 07:57:32 UTC RemoteUrl: https://github.com/ncn-foreigners/blocking RemoteRef: HEAD RemoteSha: 6cfda9bab507d91c5e7a852b321796fde5a1d367 NeedsCompilation: no Packaged: 2026-06-14 09:40:55 UTC; root Author: Maciej Beręsewicz [aut, cre] (ORCID: ), Adam Struzik [aut, ctr] Maintainer: Maciej Beręsewicz