Golf ball, P. (2000). In the P. Basketball, H. F. Spirer, & L. Spirer (Eds.), Deciding to make the Instance: Exploring Large scale Person Rights Abuses Using Advice Expertise and you will Studies Analysis. AAAS.
Belin, T. Roentgen., & Rubin, D. B. (1995). A method to possess calibrating not true-fits prices into the checklist linkage. Record of one’s Western Mathematical Connection, 90(430), 694–707.
Bilenko, M., & Mooney, Roentgen. J. (2003). Adaptive Duplicate Recognition Using Learnable Sequence Similarity Methods. In the KDD ’03 (pp. 39–48). ACM.
Christen, P. (2008). Automated List Linkage Playing with Seeded Nearest Neighbour and you will Service Vector Machine Class. Inside the KDD ’08 (pp. 151–159). ACM.
Christen, P. (2012). A study from indexing strategies for scalable record linkage and you can deduplication. IEEE Deals into Training and you can Studies Systems, 24(9), 1537–1555.
Cohen, W., Raviku). An assessment regarding sequence metrics to own coordinating brands and info. For the KDD workshop into data cleanup and object combination (Vol. 3, pp. 73–78).
Copas, J., & Hilton, F. (1990). Checklist linkage: Statistical activities having coordinating computer records. Diary of the Royal Statistical Neighborhood, Series A great, 153(3), 287–320.
Dai, An effective. Meters., & Storkey, A good. J. (2011). The categorized publisher-situation design to have unsupervised entity solution. When you look at the Fake neural sites and server learning–icann 2011 (pp. 241–249). Springer.
Fortini, Yards., Liseo, B. silverdaddies Mail prijava, Nuccitelli, A beneficial., & Scanu, Yards. (2001). Towards the Bayesian Record Linkage. Lookup within the Certified Statistics, 4(1), 185–198.
Gutman, R., Afendulis, C., & Zaslavsky, A beneficial. (2013). A beneficial bayesian means of document hooking up to analyze prevent- of-lives scientific will cost you. Record of your Western Statistical Organization, 108(501), 34–47.
Hsu, W., Lee, Yards. L., Liu, B., & Ling, T. W. (2000). Mining Exploration when you look at the Diabetics Databases: Results and you will Conclusions. Into the KDD ’00 (pp. 430–436). ACM.
A torn-combine Markov chain Monte Carlo process of the Dirichlet techniques mix design
Jewell, N. P., Spagat, M., & Jewell, B. L. (2013). MSE and you can Casualty Counts: Presumptions, Interpretation, and you can Demands. During the T. B. Seybolt, J. D. Aronson, & B. Fischhoff (Eds.), Counting Civil Casualties: An overview of Recording and Estimating Nonmilitary Fatalities in conflict. Oxford, UK: Oxford University Force.
Larsen, Yards. D. (2002)ments into the Hierarchical Bayesian Listing Linkage. Inside Procedures of your own mutual statistical conferences, section into questionnaire lookup tips (pp. 1995–2000). The newest American Mathematical Association.
Steorts, R
Larsen, M. D. (2005). Improves in List Linkage Concept: Hierarchical Bayesian Record Linkage Concept. From inside the Process of the mutual statistical meetings, point on the survey look steps (pp. 3277–3284). The newest American Analytical Relationship.
Larsen, M. D., & Rubin, D. B. (2001). Iterative automated list linkage using mix models. Log of the Western Statistical Connection, 96(453), 32–41.
Lum, K., Price, M. Elizabeth., & Banks, D. (2013). Applications regarding Several Solutions Quote from inside the Individual Legal rights Search. The fresh new Western Statistician, 67(4), 191–two hundred.
Marchant, Letter. Grams., C., Kaplan, Good., Rubinstein, B. I. P., & Elazar, D. Letter. (2019). D-blink: Marketed avoid-to-avoid bayesian organization quality.
McCallum, Good., & Wellner, B. (2004). Conditional Different types of Title Suspicion with Application so you’re able to Noun Coreference. In Enhances during the sensory suggestions running options (nips ’04) (pp. 905–912). MIT Drive.
Miller, P. L., Frawley, S. J., & Sayward, F. G. (2000). IMM/Scrub: A domain name-Specific Product to your Deduplication from Vaccination Records Facts in Young people Immunization Registriesputers and you will Biomedical Browse, 33(2), 126–143.
Murphy, J., Brackbill, R. Meters., Thalji, L., Dolan, Yards., Pulliam, P., & Walker, D. J. (2007). Computing and you will Increasing Exposure around the globe Change Cardiovascular system Wellness Registry. Analytics in Medication, 26(8), 1688–1701.
Murray, J. S. (2016). Probabilistic record linkage and you can deduplication immediately after indexing, blocking, and you will filtering. Diary out of Confidentiality and you may Confidentiality, 7(1), 3–24.
Newcombe, H. B., Kennedy, J. M., Axford, S. J., & James, An effective. P. (1959). Automatic linkage of public record information computers can be used to pull” follow-up” statistics off family members off records from techniques information. Research, 130(3381), 954–959.
Sadinle, Meters. (2014). Detecting Duplicates from inside the a murder Registry Using an excellent Bayesian Partitioning Means. Annals out of Applied Analytics, 8(4), 2404–2434.
Sariyar, Yards., Borg, An excellent., & Pommerening, K. (2012). Effective Studying Strategies for the newest Deduplication regarding Electronic Diligent Study Using Category Woods. Record regarding Biomedical Informatics, 45(5), 893–900.
C., Hall, R., & Fienberg, S. Elizabeth. (2016). An excellent Bayesian Method of Graphical Number Linkage and you may Deduplication. Record of your own American Analytical Relationship, 111(516), 1660–1672.
Tancredi, A good., & Liseo, B. (2011). A beneficial hierarchical Bayesian way of listing linkage and you will populace size troubles. Annals off Applied Analytics, 5(2B), 1553–1585.