Skip to content
forked from MI-DPLA/combine

combine /kämˌbīn/ - Service Hub Metadata Aggregator

License

Notifications You must be signed in to change notification settings

blancoj/combine

 
 

Repository files navigation

🚜 Combine

Overview

Combine is a Django application to facilitate the harvesting, transformation, analysis, and publishing of metadata records by Service Hubs for inclusion in the Digital Public Library of America (DPLA).

The name "Combine", pronounced /kämˌbīn/, is a nod to the combine harvester used in farming used to, "efficiently harvest a variety of grain crops." Instead of grains, we have metadata records! These metadata records may come in a variety of metadata formats, various states of transformation, and may or may not be valid in the context of a particular data model. Like the combine equipment used for farming, this application is designed to provide a single point of interaction for multiple steps along the way of harvesting, transforming, and analyzing metadata in preperation for inclusion in DPLA.

Documentation

See the docs folder for more detailed documentation.

Also available, a "QuickStart" walkthrough.

Installation

Combine has a fair amount of server components, dependencies, and configurations that must be in place to work, as it leverages Apache Spark, among other applications, for processing on the backend.

To this end, a separate GitHub repository, Combine-playbook, has been created to assist with provisioning a server with everything neccessary, and in place, to run Combine. This repository provides routes for server provisioning via Vagrant and/or Ansible. Please visit the Combine-playbook repository for more information about installation.

About

combine /kämˌbīn/ - Service Hub Metadata Aggregator

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • JavaScript 72.7%
  • Python 13.8%
  • HTML 5.8%
  • CSS 5.8%
  • XSLT 1.9%