Clinical Data and AI Integration Analyst

King's College London

Job ID: 120725. Salary: £53,149 – £57,566 per annum, including London Weighting Allowance.

Posted: 23 July 2025. Closing date: 20 August 2025.

Business unit: IoPPN. Department: Biostatistics & Health Informatics.

Contact details: Dr Fernando Lanza / Thomas Searle. fernando.lanza@kcl.ac.uk / thomas.searle@kcl.ac.uk

Location: Denmark Hill Campus. Category: Research.

About us:

CogStack ( https://cogstack.org/) is an award-winning ecosystem of tools and workflows that facilitate the ingestion, structuring, organising and visualisation of Electronic Health Record (EHR) data built by a multidisciplinary team of software developers, machine learning engineers, clinical researchers and health informaticians.

The CogStack team is at the forefront of building impactful solutions and partnering with NHS Trusts and healthcare providers, tackling real-world clinical problems, supporting use cases from state-of-the-art clinical research through to translational research, delivering innovative solutions for direct patient care (How Elastic improves patient outcomes with valuable healthcare data;  https://doi.org/10.1101/123299).

The CogStack team benefits from sitting within a leading programme of clinical, health and bioinformatics at the South London and Maudsley (SLaM) Biomedical Research Centre (BRC) and forms a key component of both the Centre for Translational Informatics (www.ctiuk.org) and actionable analytics theme of the recently awarded Health Data Research UK (HDR UK) London site.

Major funding has been awarded by the Office for Life Sciences, InnovateUK and recently a Stage 3 AI for Health and Social Care Award from NHSx. The ecosystem has already been recognised in Government reports to the Chief Medical Officer, NHSx AI report, NHS Tech Plan and keynote speeches by the Health Secretary.

About the role:

The Clinical Data Linkage Service (CDLS), hosted by the NIHR Maudsley Biomedical Research Centre (BRC), provides secure and ethical linkage between datasets from King’s College Hospital (KCH), Guy’s and St Thomas’ NHS Foundation Trust (GSTT), and the Clinical Record Interactive Search (CRIS) platform at South London and Maudsley NHS Foundation Trust (SLaM).

The postholder will support and maintain the use of CogStack to extract and process clinical data at KCH and GSTT for linkage to CRIS via the CDLS. This includes working closely with CogStack colleagues, data controllers, research teams and operational stakeholders across Trusts to ensure high-quality, auditable and timely data processing pipelines.

The post holder will be expected to be able to contribute to the following areas:

  • Operational support for running and maintaining CogStack pipelines at KCH and GSTT, with a focus on secure, high-quality, and auditable EHR data extraction for the CDLS.
  • Implementation and documentation of ETL workflows to map data to CDLS and CRIS data structures.
  • Contribution to the technical specification and troubleshooting of issues arising in clinical data provisioning, NLP processing, or linkage preparation.
  • Extension of CogStack-NiFi or other internal modules for custom data routing, transformation or enrichment tasks (e.g. MedCAT NER+L).
  • Data quality assessment and contribution to the development of automated or manual quality control tools for clinical datasets.
  • Communication of requirements and constraints to clinical and non-technical audiences, especially in relation to information governance and linkage protocols.
  • Collaboration with the broader CogStack team to ensure architectural alignment, reusable components, and long-term platform sustainability.

This is a full-time post (35 hours per week), and you will be offered a fixed-term contract until 30/11/2027.

About you:

To be successful in this role, we are looking for candidates to have the following skills and experience:

Essential criteria

  1. MSc or equivalent experience in a relevant area such as computer science, health informatics, software engineering, or data science
  2. Experience with Python and/or Java for data engineering or EHR processing tasks
  3. Experience with ETL workflows, database systems (e.g. PostgreSQL, SQL Server), and API-based data integration
  4. Knowledge of data security, audit logging and information governance in healthcare settings
  5. Experience working with version control (e.g. Git), DevOps practices and container technologies such as Docker
  6. Strong communication skills and ability to work across multi-disciplinary and cross-organisational teams

Desirable criteria

  1. Experience working in or with NHS Trusts or other health data environments
  2. Experience with MedCAT, CogStack-NiFi or similar clinical NLP tools
  3. Understanding of CRIS or similar de-identified research platforms
  4. Experience managing complex or high-volume data pipelines in production environments
  5. Knowledge of FHIR, SNOMED CT or other healthcare interoperability and ontology standards

Downloading a copy of our Job Description

Full details of the role and the skills, knowledge and experience required can be found in the Job Description document, provided at the bottom of the page. This document will provide information on the criteria that will be assessed at each stage of the recruitment process.

Further information:

We ask all candidates to submit a copy of their CV and a supporting statement detailing how they meet the essential criteria listed in the advert. If we receive a strong field of candidates, we may use the desirable criteria to choose our final shortlist, so please include your evidence against these where possible.

To find out how our managers will review your application, please take a look at our ‘How we Recruit’ pages.

Similar jobs

View more jobs