User Interface for Deduplicating Records



  • I am looking for UX interface patterns that can be used to dedupe records.

    Suppose, for example, one has a database of customers. Perhaps Ronald T. Wiggins and Ronald Wiggins are both in the database and have the same address. Users would like to be able to merge those two records into one canonical record.

    One pattern that facilitates record merging is provided by OpenRefine, the open source project by Google for data cleaning. We can see below various "clusters" where each cluster contains items that a user can merge into a single canonical record:

    enter image description here

    In my PhD research I was working on identifying visual copyright infringement, and rigged up a basic UI that let me identify whether or not images match and should be clustered:

    enter image description here

    Has anyone encountered interfaces that facilitate this kind of record merging? Any examples others can offer would be hugely helpful!



  • Most commonly seen example I can think of is the https://www.technipages.com/excel-highlight-duplicate-unique-values feature in Microsoft Excel where you can perform a number of operations on duplicate values depending on https://www.wallstreetmojo.com/highlight-duplicates-in-excel/ the information.

    enter image description here

    enter image description here




Suggested Topics

  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2
  • 2