Baseball, P. (2000). In P. Golf ball, H. F. Spirer, & L. Spirer (Eds.), Deciding to make the Instance: Examining Large-scale People Legal rights Abuses Using Pointers Options and you will Investigation Analysis. AAAS.
Belin, T. Roentgen., & Rubin, D. B. (1995). A technique to possess calibrating untrue-matches prices from inside the listing linkage. Log of one’s Western Mathematical Connection, 90(430), 694–707.
Bilenko, Meters., & Mooney, Roentgen. J. (2003). Adaptive Duplicate Identification Having fun with Learnable Sequence Resemblance Methods. When you look at the KDD ’03 (pp. 39–48). ACM.
Christen, P. (2008). Automatic Record Linkage Having fun with Seeded Nearby Neighbour and you may Support Vector Server Category. In KDD ’08 (pp. 151–159). ACM.
Christen, P. (2012). A survey regarding indexing approaches for scalable listing linkage and you can deduplication. IEEE Purchases to your Studies and you may Data Technologies, 24(9), 1537–1555.
Cohen, W., Raviku). A comparison away from string metrics for coordinating labels and you may information. When you look at the KDD working area into the research clean and you will target consolidation (Vol. step three, pp. 73–78).
Copas, J., & Hilton, F. (1990). Number linkage: Analytical models to have matching desktop info. Diary of Regal Statistical Neighborhood, Series A beneficial, 153(3), 287–320.
Dai, Good. Meters., & Storkey, An effective. J. (2011). Brand new grouped author-question design to own unsupervised organization resolution. Within the Fake sensory channels and servers discovering–icann 2011 (pp. 241–249). Springer.
Fortini, Meters., Liseo, B., Nuccitelli, A beneficial., & Scanu, Meters. (2001). On Bayesian Number Linkage. Search in the Formal Statistics, 4(1), 185–198.
Gutman, Roentgen., Afendulis, C., & Zaslavsky, A. (2013). A beneficial bayesian procedure for file linking to analyze prevent- of-life scientific can cost you. Journal of your American Mathematical Association, 108(501), 34–47.
Hsu, W., Lee, Yards. L., Liu, B., & Ling, T. W. (2000). Mining Mining into the Diabetics Databases: Conclusions and Results. For the KDD ’00 (pp. 430–436). ACM.
A torn-combine Markov chain Monte Carlo means of this new Dirichlet processes mixture model
Jewell, Letter. P., Spagat, M., & Jewell, B. L. (2013). MSE and you may Casualty Counts: Presumptions, Translation, and you may Pressures. Into the T. B. Seybolt, J. D. Aronson, & B. Fischhoff (Eds.), Counting Civil Casualties: An introduction to Tape and you can Estimating Nonmilitary Fatalities incompatible. Oxford, UK: Oxford College Drive.
Larsen, Yards. D. (2002)ments to your Hierarchical Bayesian List Linkage. For the Process of mutual mathematical group meetings, point towards survey look methods (pp. 1995–2000). This new Western Statistical Organization.
Steorts, Roentgen
Larsen, M. D. (2005). Advances in the Number Linkage Theory: Hierarchical Bayesian Listing Linkage Concept. In the Legal proceeding of your mutual statistical meetings, part on questionnaire search methods (pp. 3277–3284). Brand new American Statistical Connection.
Larsen, Yards. D., & Rubin, D. B. (2001). Iterative automated list linkage playing with mixture patterns. Log of the Western Mathematical Connection, 96(453), 32–41.
Lum, K., Price, M. E., & Financial institutions, D. (2013). Software out-of Multiple Possibilities Quote for the Individual Liberties Browse. The fresh new vrlo vruД‡a Slavenski djevojka Western Statistician, 67(4), 191–200.
Marchant, N. Grams., C., Kaplan, A great., Rubinstein, B. I. P., & Elazar, D. N. (2019). D-blink: Delivered prevent-to-prevent bayesian entity resolution.
McCallum, An effective., & Wellner, B. (2004). Conditional Models of Term Suspicion having Software to Noun Coreference. For the Improves when you look at the sensory recommendations processing expertise (nips ’04) (pp. 905–912). MIT Press.
Miller, P. L., Frawley, S. J., & Sayward, F. G. (2000). IMM/Scrub: A site-Specific Equipment into Deduplication away from Inoculation History Facts inside the Teens Immunization Registriesputers and Biomedical Lookup, 33(2), 126–143.
Murphy, J., Brackbill, R. Meters., Thalji, L., Dolan, Yards., Pulliam, P., & Walker, D. J. (2007). Calculating and you will Promoting Exposure global Change Cardiovascular system Wellness Registry. Statistics in Medication, 26(8), 1688–1701.
Murray, J. S. (2016). Probabilistic list linkage and you can deduplication immediately following indexing, blocking, and you will filtering. Log from Privacy and you may Confidentiality, 7(1), 3–24.
Newcombe, H. B., Kennedy, J. Yards., Axford, S. J., & James, A good. P. (1959). Automated linkage out of public information servers are often used to extract” follow-up” analytics of household out of data of program ideas. Technology, 130(3381), 954–959.
Sadinle, Yards. (2014). Finding Duplicates for the a homicide Registry Using an effective Bayesian Partitioning Means. Annals of Used Analytics, 8(4), 2404–2434.
Sariyar, M., Borg, Good., & Pommerening, K. (2012). Effective Discovering Tricks for this new Deduplication off Electronic Patient Research Having fun with Class Trees. Record of Biomedical Informatics, 45(5), 893–900.
C., Hall, Roentgen., & Fienberg, S. E. (2016). Good Bayesian Approach to Visual Record Linkage and you can Deduplication. Journal of your own American Statistical Connection, 111(516), 1660–1672.
Tancredi, A good., & Liseo, B. (2011). A great hierarchical Bayesian approach to listing linkage and you will inhabitants dimensions difficulties. Annals off Used Statistics, 5(2B), 1553–1585.