How to Build a Django App for Manipulating VuFind Records Using PySolr?

I am working on a project where I need to fetch, manipulate, and manage metadata records stored in a VuFind Solr index. The goal is to create a Django-based application that leverages PySolr to:

  • Fetch data from the VuFind Solr core.
  • Search and edit records within the Solr index.
  • Identify and manage duplicates, including: (Finding duplicate records based on specific fields; Merging duplicate records; Standardize records by normalizing data in selected fields)

I've written some custom code to interact with VuFind's Solr core, but I am facing challenges with:

  • Efficient querying for large datasets (e.g., searching through millions of records).
  • Implementing a robust deduplication logic (considering multiple fields and weighted matching).
  • Updating records in Solr without affecting unrelated data.
  • Designing a user-friendly interface for reviewing, editing, and merging records.
  • Code execution issue: When I run the code, it doesn't execute properly. The browser doesn't open, and I can't figure out what might be going wrong.

here is the full code file

Вернуться на верх