​Frequently Asked Questions (FAQs)

What is the Data Innovation Office? 

AKU has invested in the creation of the Data Innovation Office that is focused on supporting research projects that employ data intensive methodologies. The CDIO deploys all projects on a modern data stack utilizing cloud technologies and open-source algorithms. A key mandate for the CDIO office is to leverage data in innovative ways to address population health challenges in East Africa thereby leapfrogging over many traditional Western methodologies. 

What does the Data Innovation Office (DIO) do? 

DIO Office – What We Do & Our Key Services and Projects

  • Data Repositories: Centralized access to organized internal and external datasets for research. We have an EHR population health data repository including 3.4M unique patient records from primary and secondary care facilities across Nairobi, Mombasa, and Kisumu over 10+ years.

  • ​Decision Support Systems: Empowering AKU's research and academic leaders with actionable insights.

  • Operational Efficiency: We offer streamlined solutions and optimized workflows through systems and automation to enhance productivity and reduce manual processes such as Library Information Management Systems and automated academic paper scraping.​

  • Data Infrastructure Solutions: Cloud-based data management and architecture services - we deploy all our projects on a modern data stack; we are Kenya’s first academic research project leveraging cloud and open-source technologies.

  • AI Research Hub (Innovation and Research advancement): Guidance, support and resources to data-intensive research projects for AKU in Kenya and leverage emerging technologies, such as synthetic data, machine learning, large language models (LLMs), and generative AI.

  • Capacity Building: Training sessions, webinars, and workshops designed to enhance data literacy and skills in research. (AKU Data Innovators Club​)

  • Data Protection and Compliance: We offer expertise and consultation in data protection regulations and processes.

  • Communication And Data Storytelling: Support for disseminating research through innovative communication strategies beyond traditional channels – social media, data storytelling.

  • Curated Data Events: Events designed to foster networking, collaboration, and knowledge exchange among researchers and practitioners.

What should the AKU community and external partners expect from you?

  • ​Proven working formula for establishing data infrastructure for research projects.

  • Streamlined processes and cost savings vs what they currently use.

  • Access to Diverse Data Sources: Utilize a vast repository of medical records for research purposes.

  • High-Performance Computing Resources: Access infrastructure for running complex AI algorithms compliant with regulatory standards.

  • Mentorship and Collaboration: Engage in mentorship programs, collaborative workspaces, and interdisciplinary projects.​

What are the key tenants for the Data Innovation Office?

  • We embrace a 'cloud-first, open-source' architecture approach, ensuring efficient and scalable solutions.

  • We harness cutting-edge technologies—including generative AI, scalable cloud computing, and open-source platforms—to deliver high-performance, accessible, and research-ready solutions tailored to the unique needs of our academic community

  • We remain agile, constantly iterating and adapting to emerging open-source algorithms, standards, and methodologies.

  • We have a proven ability to deliver high-impact solutions within the resource constraints of LMICs—maximizing value through cost-effective, purpose-driven strategies.

  • We uphold the highest ethical and regulatory standards, ensuring full compliance with data protection laws and country-specific regulations.

  • We work in small cross functional teams - collaborations with multi-disciplinary, multi-functional expertise.

  • We ensure open access and collaboration to foster a collaborative environment where researchers, industry professionals, and policymakers can share knowledge and expertise.

  • We focus on Sub-Saharan Africa, prioritizing research that addresses the unique challenges and opportunities of East Africa.

  • We nurture vibrant knowledge ecosystems by organizing conferences, workshops, and events that bring together researchers, industry professionals, and policymakers. Our multi-disciplinary teams play a key role in shaping these engagements, ensuring that diverse perspectives inform collaborative problem-solving and drive innovation at the intersection of multiple sectors.

What is an example of your office supporting researchers?

  1. A researcher wants to undertake a study on prevalence of heart attacks amongst elderly population in the urban areas of Kenya. 
  2. Data Source: We provide anonymized medical records from our health data repository to support their grant application.
  3. Data Preparation: Our data engineers clean and process the data to make it ready for analysis.
  4. Data Infrastructure: The Data Innovation Office establishes systems for storage and processing of data to make sure they’re compatible and compliant with data privacy regulations.
  5. Consultation and training: We advice on applicable AI models that can be used for predictive analysis
  6. Privacy and Ethics: We ensure compliance with data privacy regulations and ethical guidelines throughout the pipeline.​

I hear the Data Innovation Office supports the UZIMA-DS​ project – in what capacity?

The Data Innovation team plays a vital role in supporting the data infrastructure needs of the UZIMA-DS project. Operating within a cloud-first, open-source environment, the UZIMA-DS architecture undergoes continuous iteration, incorporating emerging data algorithms, best practices, and standards. The team's primary responsibility is to ensure that all data required by researchers undergoes processing and cleansing through high-quality data pipelines and is then organized within a data model optimized​ for analysis. Additionally, the Data Innovation team provides essential training and support to researchers on accessing cloud resources and utilizing tools effectively. Furthermore, the team ensures compliance with Kenyan Data Protection laws and serves as the primary contact with the Office of the Data Protection Commissioner.​​