DataCite
Redesigning DataCite's Harvesting Services
Redesigned DataCite's Harvesting Services from the ground up to boost service utilization and unlock new revenue opportunities through user-centered design research.

Overview
Redesigned DataCite's Harvesting Services from the ground up to address underutilization and unlock untapped revenue opportunities.
DataCite's existing Harvester Services were significantly underutilized despite the fact that users of such services are typically willing to pay for them. We recognized an untapped revenue opportunity if we could boost service utilization through better understanding of user needs and improved service design.
The challenge was to understand why the current services weren't meeting user needs and redesign them to be more valuable and accessible to our target audience.
My Role
My Contribution
- UX Research
- Workshop Facilitation
- Product Design
The Team
- 1 x Product Designer
- 1 x Project Manager
- 2 x Developers
Research Methods
- Focus Group (56 participants)
- Design Sprint (8 participants)
- Cognitive Walkthroughs (5 sessions)
- Concept Testing (5 sessions)
Year: 2022
Problem Statement
DataCite's Harvesting Services faced several critical challenges:
- Low utilization rates despite market demand for harvesting services
- Unclear value proposition for potential paying customers
- Limited understanding of user needs and preferences
- Technical constraints that didn't align with user workflows
- Missed revenue opportunities in a market willing to pay for quality services
The existing service architecture and user experience weren't meeting the diverse needs of organizations looking to harvest research data and metadata.
Research Methodology
Focus Group Study
We initiated the project with a comprehensive focus group study involving representatives from 10 major organizations. This exploratory approach allowed us to understand their needs in terms of harvesting services and their willingness to pay for such services.
The focus group provided critical insights into:
- Current pain points with existing harvesting solutions
- Preferred data formats and delivery methods
- Pricing expectations and budget constraints
- Technical requirements and integration needs
Design Sprint Approach
A design sprint was subsequently conducted to generate a comprehensive initial prototype for the new harvesting service. This sprint involved around 7 internal collaborators from different departments and was facilitated virtually using Miro due to the pandemic.
The aim was to ensure that the service redesign would meet the needs of all organizational corners. To foster a long-term vision, we slightly altered the design sprint's activity order, positioning the mapping activity as a secondary step.
Design Sprint Process
Day 1: Expert Interviews & How Might We
I prepared a bespoke questionnaire to guide participants during the "expert interviews" activity of the design sprint, which led to 83 "how might we" notes. This comprehensive collection of insights formed the foundation for our ideation process.
Days 2-4: Mapping, Sketching, and Prototyping
Over the next three days, I guided the participants through each stage of the design sprint, focusing significant effort on the mapping exercise and sketching day preparation.
Two top-voted sketches were selected for further prototyping in Figma, followed by an Expert Walkthrough validation with eight users to collect further feedback.
Key Findings
Through our research methodology, we identified six critical requirements for the redesigned harvesting service:
Format Requirements
Both XML and JSON need to be provided for different use cases, as organizations have varying technical infrastructures and preferences.
Frequency Needs
Monthly and daily harvesting are of the essence for most users, with some requiring even more frequent updates for time-sensitive applications.
Metadata Specifications
Most cases will need only metadata but citation data could be packaged separately, allowing for more flexible and cost-effective service tiers.
Business Model
A tiered freemium approach is possible with a Premium service specifically designed for for-profit organizations willing to pay for enhanced features.
Integration Requirements
Tight integration with the DataCite CRM will be needed to provide seamless user management and billing capabilities.
Snapshot Services
For-profits would prefer snapshots due to the high demands on frequency, providing complete datasets at specific points in time rather than incremental updates.
Solution Architecture
Based on our research findings, we developed a comprehensive solution that addressed all identified user needs:
Tiered Service Model
- Basic Tier: Free access to essential metadata harvesting
- Premium Tier: Enhanced features for commercial users including higher frequency updates, priority support, and additional data formats
Technical Implementation
- Multi-format support for both XML and JSON outputs
- Flexible frequency options from daily to real-time updates
- Snapshot capabilities for complete dataset downloads
- API-first architecture for seamless integration
User Experience Improvements
- Simplified onboarding process for new users
- Self-service portal for managing harvesting preferences
- Clear documentation and support resources
- Transparent pricing and service level agreements
Impact and Results
The implementation of our research-driven recommendations delivered significant business impact:
42% Reduction in Lead Time
The findings scoped the design work sufficiently to start working on the MVP earlier and faster, accelerating time-to-market for the improved service.
New Customer Relationships
The implementation of these recommendations led to the establishment of relationships with four potential paying users of the harvesting service, representing significant revenue opportunities.
Immediate User Adoption
The introduction of the snapshot feature allowed 12 new users to take advantage of the service right away, demonstrating immediate market validation.
Stakeholder Testimonial
"When we were involved in the design sprint for the DataCite harvesting service, the 'How Might We' question technique supported creativity while focusing on the problem to solve for the user."
Britta Dreyer
Head of PID and Metadata Services, Technische Informationsbibliothek (TIB)
Insights and Recommendations
Our research led to three key strategic recommendations:
-
Define a tiered approach for the harvesting service, including a basic service for users requiring only citation metadata and a premium service for others.
-
Modify the tech stack and user profile service to accommodate the new premium tier users and their specific requirements.
-
Scale the infrastructure to address the increased needs for higher frequency harvesting and larger data volumes.
Conclusion
The DataCite Harvesting Services redesign project demonstrates the power of user-centered design research in identifying and addressing service utilization challenges. By conducting comprehensive focus groups and design sprints, we were able to uncover specific user needs that weren't being met by the existing service.
The project successfully transformed an underutilized service into a viable revenue opportunity while better serving the research community's data harvesting needs. The tiered approach and technical improvements provide a sustainable foundation for future growth and service expansion.