Until recently, it was impossible to perform a single search across all the collections held by Yale University’s library and museums. That all changed in 2023 when Yale University publicly launched LUX: Yale Collections Discovery. LUX is a first-of-its-kind cross-collection discovery system that uses a linked data model to enable search and discovery across disparate collecting units. Now, Yale University has shared the open-source GitHub repository for its LUX platform, licensed under Apache 2.0. Yale hopes that sharing the project will enable other institutions with diverse cultural heritage collections to quickly adopt the innovative solution to facilitate cross-collection discovery.
A screenshot of the LUX user interface
The Yale University Library dates back to the founding of Yale College in 1701. Over the centuries, the library has grown significantly, acquiring vast collections and expanding its facilities.
The library boasts a collection of more than 15 million print and electronic resources, including books, journals, manuscripts, maps, photographs and more. Yale University Library's collections cover a wide range of subjects and disciplines, supporting research and learning across the humanities, social sciences, sciences and professional schools. The library system comprises multiple libraries spread across the Yale University campus in New Haven, Connecticut.
However, Yale’s collections reach well beyond the library system:
Yale’s cultural heritage collaboration efforts began over a decade ago, including announcing its Open Access Policy and launching initial cross-collections discovery efforts in 2011. In 2016, Yale appointed Susan Gibbons as its first Vice Provost for Collections and Scholarly Communications, with a focus on fostering connections within Yale’s collecting units. In 2017, a new level of collaboration between central IT and the collecting organizations began with the establishment of the Cultural Heritage pillar. The cross-functional collaboration of this team soon led to a centrally supported project to improve digital access to Yale’s collections.
In 2018 Yale had now begun its 5-year effort to transform cross-collection discovery. A central challenge was that each collecting unit (library, museum, gallery, archive, etc.) has its own collections management system. Descriptive practices vary between museum, archive and library collections. They can also differ across domains. For this reason, each unit has its own purpose-built search application tailored to their audience. Therefore, to search across Yale’s collections for a given interest area, a user might have to search four or more different websites. The team envisioned a rich user experience where users could explore diverse collections based on shared concepts and relationships. It became clear to the team at Yale that standard search technology would not be enough. Linked Data—and a Graph Database to support it—would be a key enabler to drive the user experience. To help drive the Linked Data strategy, Yale hired Rob Sanderson as Senior Director of Cultural Heritage in 2020.
As the project moved into architecture design, the technical requirements now included a document database, search engine and graph database. Yale explored two pathways: the multi-system integration approach and the multi-model database approach. In the multi-system approach, Yale would be responsible for selecting multiple technologies and performing the systems integration needed for the technologies to perform according to the LUX use case. In a multi-model database approach, Yale would choose a technology that natively supports text, document and graph with a single backend. In 2021, following a proof-of-concept phase featuring several search engines and graph databases, Yale chose the Progress MarkLogic Enterprise Data Platform due to its native multi-model database and search capabilities. With this approach, Yale saw an opportunity to focus more time on frontend development and less time on backend integration tasks.
The MarkLogic platform now serves as the Yale LUX backend, enabling advanced data exploration in a knowledge graph model. For more detail on the origins of CHIT and Cross-Collection Discovery at Yale, watch this presentation from Larry Gall of the Yale Peabody Museum.
The LUX Project went live to the public in June of 2023. Yale’s LUX website describes the platform as follows:
“LUX is a digital platform that enables users to search across Yale’s museums, archives, and library collections. This includes works of art, archives, scientific specimens, and other cultural heritage items held at the University. Since its foundation in the early 18th century, the University has encouraged, accumulated, and actively built its collections in the service of its educational mission. LUX supports this mission by making many of these objects and resources accessible so that new knowledge and understanding can be built by those within and beyond Yale.”
Rob Sanderson, Yale’s Senior Director For Cultural Heritage, had the following to share with Yale Daily News: “The power of LUX comes from its ability to allow users to uncover hidden relationships between objects, from shared concepts to famous figures in Yale’s collections…this groundbreaking tool brings together all the university’s collections through modern technology that will enrich discovery opportunities for users, making it easier for them to find what they’re looking for and explore new objects.”
“We think LUX is game-changing for the cultural heritage domain,” he added. “No other institution has a system with this level of capability.”
Now, Yale is sharing the code of its LUX discovery platform to give more cultural guardians a blueprint for opening their collections to the public. This way, more of the treasure trove that is the world’s finest art and body of literature can be experienced at our fingertips. Imagine, you could enter the courts of the wise man, commune with the brightest minds and tap into the wealth of generational knowledge carefully collected and preserved over the years.
To learn more, you can explore LUX for yourself and view our webinar with LUX stakeholders.
For questions, reach out to James Cahill at james.cahill@progress.com
James Cahill is a MarkLogic Account Executive at Progress supporting government and education customers. He has supported MarkLogic since 2016 working with customers large and small to solve their complex data challenges. He holds an undergraduate degree in economics from University of Michigan - Ann Arbor, and is native to the Washington, D.C. region.
Let our experts teach you how to use Sitefinity's best-in-class features to deliver compelling digital experiences.
Learn MoreSubscribe to get all the news, info and tutorials you need to build better business apps and sites