An economical Nanopore sequencing assay for human papillomavirus (HPV) genotyping
Diagnostic Pathology volume 15, Article number: 45 (2020)
Human papillomavirus (HPV) testing has been employed by several European countries to augment cytology-based cervical screening programs. A number of research groups have demonstrated potential utility of next-generation sequencing (NGS) for HPV genotyping, with comparable performance and broader detection spectrum than current gold standards. Nevertheless, most of these NGS platforms may not be the best choice for medium sample throughput and laboratories with less resources and space. In light of this, we developed a Nanopore sequencing assay for HPV genotyping and compared its performance with cobas HPV Test and Roche Linear Array HPV Genotyping Test (LA).
Two hundred and one cervicovaginal swabs were routinely tested for Papanicolaou smear, cobas HPV Test and LA. Residual DNA was used for Nanopore protocol after routine testing. Briefly, HPV L1 region was amplified using PGMY and MGP primers, and PCR-positive specimens were sequenced on MinION flow cells (R9.4.1). Data generated in first 2 h were aligned with reference sequences from Papillomavirus Episteme database for genotyping.
Nanopore detected 96 HPV-positive (47.76%) and 95 HPV-negative (47.26%) specimens, with 10 lacking β-globin band and not further analyzed (4.98%). Substantial agreement was achieved with cobas HPV Test and LA (κ: 0.83–0.93). In particular, Nanopore appeared to be more sensitive than cobas HPV Test for HPV 52 (n = 7). For LA, Nanopore revealed higher concordance for high-risk (κ: 0.93) than non-high risk types (κ: 0.83), and with similar high-risk positivity in each cytology grading. Nanopore also provided better resolution for HPV 52 in 3 specimens co-infected with HPV 33 or 58, and for HPV 87 which was identified as HPV 84 by LA. Interestingly, Nanopore identified 5 additional HPV types, with an unexpected high incidence of HPV 90 (n = 12) which was reported in North America and Belgium but not in Hong Kong.
We developed a Nanopore workflow for HPV genotyping which was economical (about USD 50.77 per patient specimen for 24-plex runs), and with comparable or better performance than 2 reference methods in the market. Future prospective study with larger sample size is warranted to further evaluate test performance and streamline the protocol.
Human papillomavirus (HPV) is generally accepted as the causative agent of cervical cancer (CC) , which was first unmasked by the landmark studies of Meisels and Fortin  and Purola and Savia . Currently, there are 198 reference HPV types listed on Papillomavirus Episteme (PaVE) database, and at least 12 were classified as high-risk by World Health Organization (WHO) International Agency for Research on Cancer (IARC) Monographs Working Group [4,5,6]. HPV testing has been adopted by several European countries for primary CC screening, to augment cytology-based screening programs [7, 8]. A number of HPV assays are available commercially, which are mainly based on direct HPV genome detection, HPV DNA amplification and E6/ E7 mRNA detection . Recent advent of next-generation sequencing (NGS) technologies has facilitated high throughput tools for infectious disease diagnostics and epidemiological research. Several research groups have explored utility of Illumina MiSeq and Ion Torrent platforms for HPV genotyping, with comparable sensitivity to well-established line blot assays and broader detection spectrum [10,11,12]. While the reagent cost is comparable to existing commercial assays for large sample batches, these NGS platforms may not be the best choice for medium sample throughput and laboratories with less resources and space. In this regard, portable Nanopore sequencers may allow more flexibility with shorter sequencing time and lower reagent cost. In light of this, we developed a Nanopore HPV genotyping protocol using 2 published primer sets, and compared its performance with 2 commercial HPV assays: cobas HPV Test and Roche Linear Array HPV Genotyping Test (LA).
Two hundred and one cervicovaginal swabs were collected from March to July, 2019 in Hong Kong Sanatorium & Hospital. The swabs were preserved in SurePath preservative fluid (Becton, Dickson and Company, Sparks, MD, USA) and routinely tested for Papanicolaou smear (Pap smear, following The Bethesda System for reporting), cobas HPV Test and LA (Roche Diagnostics, Mannheim, Germany). Routine test results are shown in Table 1.
DNA extraction and cobas HPV Test were performed using cobas 4800 system (Roche Diagnostics, Rotkreuz, Switzerland). Briefly, 500 μL of cervicovaginal specimen was added to 500 μL of sample preparation buffer and heated at 120 °C for 20 min. The mixture was brought to ambient temperature for 10 min and processed on cobas × 480 using ‘high-risk HPV DNA PCR’ protocol. Real-time polymerase chain reaction (PCR) was performed on cobas z 480. Fifty microliter of DNA extract was used for LA according to manufacturer’s recommendations. Residual DNA was used for Nanopore protocol after routine testing.
For each specimen, L1 region of HPV genome was amplified in 2 separate PCRs using PGMY and MGP primer sets [13, 14]. Primer sequences and cycling conditions are shown in Tables 2 and 3. Human β-globin gene was used as inhibition control and contamination was monitored by negative extraction control. Five microliter of each PCR amplicon was electrophoresized in 2% agarose gel (Invitrogen, Carlsbad, CA, USA) and analyzed. PCR-positive specimens were sequenced using Nanopore MinION.
Nanopore sequencing library preparation
PGMY and MGP PCR amplicons of each positive specimen were pooled and purified using AMPure XP beads (Beckman-Coulter, Brea, CA, USA). Nanopore sequencing libraries were prepared from purified amplicons using Ligation Sequencing Kit 1D (SQK-LSK109) and PCR-free Native Barcoding Expansion Kit (EXP-NBD104/114) (Oxford Nanopore Technologies, Oxford, England). The barcoded libraries were loaded and sequenced on MinION flow cells (FLO-MIN106D R9.4.1, Oxford Nanopore Technologies, Oxford, England) after quality control runs.
Data from first 2 h of sequencing runs was analyzed. FASTQ files generated by live basecalling (MinKNOW version 2.0) were demultiplexed using ‘FASTQ Barcoding’ workflow on EPI2ME (Oxford Nanopore Technologies, Oxford, England) with default minimum qscore of 7, ‘auto’ and ‘split by barcode’ options. FASTQ files of each specimen were concatenated into a single file and analyzed using a 2-step custom workflow on Galaxy bioinformatics platform. Briefly, FASTQ files were converted into FASTA format, followed by aligning sequences against HPV reference genomes from PaVE database using NCBI BLAST+ blastn (Galaxy version 1.1.1). PGMY and MGP reads were sorted based on sequence length and analyzed individually. Threshold of each run was derived from average number of background reads plus 10 standard deviations, which were calculated using interquartile rule, excluding first and last quartiles. A positive HPV call was based on either (1) the number of reads for a particular HPV type was above threshold, or (2) the specimen had the highest number of reads for a particular HPV type. All positive calls were further assessed by aligning FASTQ sequences against HPV reference genomes using minimap2 (Galaxy version 2.17 + galaxy0), and consensus sequences were built from BAM files using Unipro UGENE (version 1.29.0) for determining their percentage of identity to reference genomes.
As HPV 66 is categorized as ‘other high-risk’ by cobas HPV Test, all calculations were based on this grouping, albeit HPV 66 was found as a single infection in cancers with extreme rarity and re-classified as possible carcinogen (Group 2B) by IARC Monographs Working Group .
The results are summarized in Table 1. PCR was successful for 191 specimens (191/201, 95.02%), with 10 specimens (10/201, 4.98%) lacking β-globin band and therefore regarded as inappropriate for further analysis. Seventy-six specimens (76/201, 37.81%) were negative for both PGMY and MGP PCRs, and 115 (115/201, 57.21%) were positive for either of the two. PCR-positive specimens were sequenced on 10 MinION flow cells with 145–890 active pores, generating 31,748–525,880 HPV reads in first 2 h (Table 4). For the 115 specimens sequenced, 19 were negative (7–522 reads, 113 in average) and 96 were positive (45–96,549 reads, 20,158 in average) for HPV. Taken together, there were 95 HPV-negative (95/201, 47.26%) and 96 HPV-positive (96/201, 47.76%) specimens by Nanopore workflow.
Table 5 shows concordance of Nanopore workflow with cobas HPV Test and LA, which was based on the 37 HPV types detectable by LA. For cobas HPV Test, our workflow achieved 93.19, 93.19 and 81.94% for perfect, total and positive agreement, respectively, with Cohen’s kappa of 0.85. For LA, Nanopore achieved a perfect agreement of 83.77% for both high-risk and non-high risk HPVs. For high-risk types, total and positive agreement were 96.86 and 91.78%, respectively, with Cohen’s kappa of 0.93. For non-high risk types, total and positive agreement were 93.19 and 77.59%, respectively, with Cohen’s kappa of 0.83.
Table 6 shows per-type concordance of Nanopore and LA. A total of 13 high-risk and 19 non-high risk HPV types were evaluated. Positive agreement for HPV 16 (n = 8) and 18 (n = 1) were 87.5 and 100%, respectively. Positive agreement was 75–100% for high-risk HPV 31, 33, 35, 39, 51, 52, 56, 58, 59 and 66, and 20% for HPV 68 (n = 5). For non-high risk HPVs, positive agreement was 37.5–100% for HPV 6, 11, 40, 42, 53, 54, 55, 61, 62, 70, 72, 73, 81, 82, 83, 84 and 89. There were 2 non-high risk types with 0% positive agreement (HPV 26 and 71). HPV 26 (n = 1) was only detected by Nanopore workflow, whereas HPV 71 (n = 2) was only detected by LA.
Table 7 reveals the percentage of identity of Nanopore consensus sequences to HPV reference genomes. In general, Nanopore consensus sequences showed an average identity of 98% to the best matches, with an average difference of 15% from second BLAST hits.
Table 8 summarizes HPV status of each cytology grading. For high-grade and low-grade squamous intraepithelial lesion (HSIL and LSIL), nearly all specimens were positive for high-risk HPV (HSIL: 4/4, 100%; LSIL: 16/18, 88.89%). For atypical squamous/ glandular cells, about half of the specimens were positive for high-risk HPV (by LA: 19/41, 46.34%; by Nanopore: 18/41, 43.90%). For cases without observable abnormalities, 22.12% (25/113) and 21.24% (24/113) were positive for high-risk HPV by LA and Nanopore, respectively.
Hong Kong has been one of the Asian regions with the lowest incidence and mortality rate of CC . This might be attributable to the territory-wide cervical screening program implemented by Department of Health since 2004. The program is well-organized, which involves public education, regular cervical smear and follow-up service for eligible women, and a quality assurance mechanism on key components of the program . Cytology is the mainstay of primary screening, and high-risk HPV testing may be performed for triage to colposcopy.
Cytology and HPV testing have their own value for CC screening. High quality cytology has high specificity for CC, but with lower sensitivity ranging from 50% suggested by cross-sectional studies to 75% estimated longitudinally . For HPV testing, the sensitivity was reported to be about 10% higher than cytology, yet with lower specificity . Complementary use of both tests could enhance the sensitivity approaching 100% with high specificity (92.5%) . In fact, this combined approach has been adopted by several European countries and may become the future trend of primary CC screening in developed countries.
Compared with HPV assays in the market, HPV genotyping by NGS offers a broader detection spectrum which, despite minimal benefit of non-high risk HPV information for CC screening, may provide important etiologic clues for other HPV-associated infections and a more complete picture of HPV epidemiology. For the latter, Nanopore identified more HPV types per sample (Fig. 1) and 5 extra HPV types (HPV 43, 44, 74, 87 and 90, n = 34) not detectable by LA (Fig. 2), with an unexpected high incidence of HPV 90 (n = 12) which was reported in North America and Belgium but not in Hong Kong [20, 21]. Another advantage offered by NGS is its potential utility for simultaneous characterization of cervicovaginal microbiome, with its possible role in dysplasia and carcinogenesis revealed by accumulating research evidence [22,23,24,25]. These merits may facilitate a multifaceted approach for evaluation of woman health in near feature.
In general, Nanopore had substantial agreement with cobas HPV Test and LA. Compared with cobas HPV Test, Nanopore appeared to be more sensitive for HPV 52 (n = 7) and 59 (n = 4), with 81.82% (9/11) of these discrepant results matched with LA. Compared with LA, concordance for high-risk HPV was higher than non-high risk types. Among the 37 discrepant results, 22 were false negatives by Nanopore and 15 were not detected by LA.
For the false negatives by Nanopore, more than half (12/22, 54.55%) were mixed infections, and similar finding was reported by other research groups using HPV consensus primers for NGS-based genotyping [10, 11]. Other possible causes of false negatives included (1) low viral load, as evident by Specimen 182, from which HPV 16 was missed by both Nanopore and cobas HPV Test; (2) substantial difference in DNA input (50 μL for LA versus 5 μL for PGMY/ MGP PCR), as well as (3) lower sensitivity due to reduced magnesium chloride concentration of PGMY PCR (from 4 mM to 1.5 mM), which was fine-tuned for minimal non-specific amplification.
For the 15 HPV types missed by LA, the average identity of Nanopore consensus sequences was 98.27% with an average difference of 16% from second BLAST hits (Table 7). As distinct HPV types generally have more than 10% difference in L1 sequence [26, 27], it appeared that the discrepant positive calls were less likely caused by high sequencing error rate of Nanopore. More specifically, 5 of these positive calls were identified solely by MGP PCR (5/15, 33.33%), 5 detected by PGMY PCR only (5/15, 33.33%), and 5 by both PCRs (5/15, 33.33%). These revealed differential sensitivities of PGMY and MGP PCR primers, which might complement with each other and enhance overall performance of the Nanopore assay. On the other hand, Nanopore sequencing might improve the resolution of genotyping, which might not be attained by line blot method due to cross-hybridization of certain probes. For instance, Nanopore identified HPV 52 in Specimen 5, 40 and 74, which could not be confirmed by LA due to cross-hybridization with HPV 33 and 58, respectively. Another example was Specimen 125, which was HPV 84-positive by LA and HPV 87-positive by Nanopore. From literature, Artaza-Irigaray and colleagues reported cross-hybridization between these 2 HPV types by LA, with 11.5% of HPV 84-positive cervical specimens by LA were actually HPV 87-positive by NGS .
The Nanopore method and LA revealed very similar high-risk HPV positivity in each cytology grading. The goal of combined cytology-HPV testing approach is to enhance cost effectiveness of CC screening. While minimizing unnecessary referral for colposcopy, HPV genotyping may identify high-risk individuals before observable cytological abnormalities, for instance, the 4 HPV 16-positive patients without abnormal cytology findings in this study. This may facilitate an early detection approach for cancer prevention.
Our study had several limitations. First, the sample size of certain HPV types, for example, HPV 18 (n = 1), was less satisfactory for evaluating type-specific performance. Second, as residual DNA was used after routine testing, DNA input for PGMY and MGP PCRs was constrained which might lower the sensitivity. In addition, as flow cells with suboptimal number of active pores were used, sequencing time and depth might be further improved if new flow cells were used.
We developed a Nanopore workflow for HPV genotyping, with performance comparable to or better than 2 reference methods in the market. Our method was economical, with a reagent cost of about USD 50.77 per patient specimen for 24-plex runs, which was competitive when compared to an average price of USD 106.14 (from 4 randomly-selected laboratories) for HPV genotyping referral service in our region (Table 9). The protocol was also straightforward with reasonable turnaround time of about 12 h from samples to answers. The small size and portability of MinION sequencers may well suit remote or resource-limited laboratories with constraints in space. Future prospective study with larger sample size is warranted to further evaluate test performance and streamline the protocol. As LA was discontinued in Hong Kong, the Nanopore workflow described here may provide an economical option for broad-range HPV genotyping.
Availability of data and materials
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
High-grade squamous intraepithelial lesion
International Agency for Research on Cancer (IARC)
Roche Linear Array HPV Genotyping Test
Low-grade squamous intraepithelial lesion
- Pap smear:
Polymerase chain reaction
World Health Organization
zur Hausen H. Papillomaviruses in the causation of human cancers – a brief historical account. Virology. 2009;384:260–5.
Meisels A, Fortin R. Condylomatous lesions of the cervix and vagina. I Cytologic patterns Acta Cytol. 1976;20:505–9.
Purola E, Savia E. Cytology of gynecologic condyloma acuminatum. Acta Cytol. 1977;21:26–31.
Muñoz N, Bosch FX, de Sanjosé S, Herrero R, Castellsagué X, Shah KV, et al. Epidemiologic classification of human papillomavirus types associated with cervical cancer. N Engl J Med. 2003;348:518–27.
IARC Working Group on the Evaluation of Carcinogenic Risks to Humans. Human papillomaviruses. IARC Monogr Eval Carinog Risks Hum. 2007;90:1–636.
Schiffman M, Clifford G, Buonaguro FM. Classification of weakly carcinogenic human papillomavirus types: addressing the limits of epidemiology at the borderline. Infect Agent Cancer. 2009;4:8.
Chrysostomou AC, Stylianou DC, Constantinidou A, Kostrikis LG. Cervical cancer screening programs in Europe: the transition towards HPV vaccination and population-based HPV testing. Viruses. 2018;10:729.
Petry KU, Barth C, Wasem J, Neumann A. A model to evaluate the costs and clinical effectiveness of human papilloma virus screening compared with annual Papanicolaou cytology in Germany. Eur J Obstet Gynecol Reprod Biol. 2017;212:132–9.
Pan American Health Organization. Section 2: Summary of commercially available HPV tests. https://www.paho.org/hq/dmdocuments/2016/manual-VPH-English-02.pdf (2016). Accessed 3 Jan 2020.
Nilyanimit P, Chansaenroj J, Poomipak W, Praianantathavorn K, Payungporn S, Poovorawan Y. Comparison of four human papillomavirus genotyping methods: next-generation sequencing, INNO-LiPA, electrochemical DNA Chip, and nested-PCR. Ann Lab Med. 2018;38:139–46.
Nowak RG, Ambulos NP, Schumaker LM, Mathias TJ, White RA, Troyer J, et al. Genotyping of high-risk anal human papillomavirus (HPV): ion torrent-next generation sequencing vs. linear array. Virol J. 2017;14:112.
Wagner S, Roberson D, Boland J, Yeager M, Cullen M, Mirabello L, et al. Development of the TypeSeq assay for detection of 51 human papillomavirus genotypes by next-generation sequencing. J Clin Microbiol. 2019;57:e01794–18.
Gravitt PE, Peyton CL, Alessi TQ, Wheeler CM, Coutlée F, Hildesheim A, et al. Improved amplification of genital human papillomaviruses. J Clin Microbiol. 2000;38:357–61.
Söderlund-Strand A, Carlson J, Dillner J. Modified general primer PCR system for sensitive detection of multiple types of oncogenic human papillomavirus. J Clin Microbiol. 2009;47:541–6.
Marín M, Garcia-Lechuz JM, Alonso P, Villanueva M, Alcalá L, Gimeno M, et al. Role of universal 16S rRNA gene PCR and sequencing in diagnosis of prosthetic joint infection. J Clin Microbiol. 2012;50:583–9.
Department of Health, the Government of the Hong Kong Special Administrative Region. Evidence for organized screening programme. https://www.cervicalscreening.gov.hk/english/about/abt_evidence.html (2013). Accessed 15 Jan 2020.
Department of Health, the Government of the Hong Kong Special Administrative Region. About Cervical Screening Programme. https://www.cervicalscreening.gov.hk/english/about/about.html (2013). Accessed 15 Jan 2020.
World Health Organization: Cervical cancer screening in developing countries. Report of a WHO consultation. https://apps.who.int/iris/bitstream/handle/10665/42544/9241545720.pdf;jsessionid=2599A27FFB141B755D015B645FB889D9?sequence=1 (2002). Accessed 15 Jan 2020.
Mayrand MH, Duarte-Franco E, Rodrigues I, Walter SD, Hanley J, Ferenczy A, et al. Human papillomavirus DNA versus Papanicolaou screening tests for cervical cancer. N Engl J Med. 2007;357:1579–88.
Quiroga-Garza G, Zhou H, Mody DR, Schwartz MR, Ge Y. Unexpected high prevalence of HPV 90 infection in an underserved population: is it really a low-risk genotype? Arch Pathol Lab Med. 2013;137:1569–73.
Schmitt M, Depuydt C, Benoy I, Bogers J, Antoine J, Arbyn M, et al. Prevalence and viral load of 51 genital human papillomavirus types and three subtypes. Int J Cancer. 2013;132:2395–403.
Mitra A, Maclntyre DA, Lee YS, Smith A, Marchesi JR, Lehne B, et al. Cervical intraepithelial neoplasia disease progression is associated with increased vaginal microbiome diversity. Sci Rep. 2015;5:16865.
Oh HY, Kim BS, Seo SS, Kong JS, Lee JK, Park SY, et al. The association of uterine cervical microbiota with an increased risk for cervical intraepithelial neoplasia in Korea. Clin Microbiol Infect. 2015;21:674 e1–9.
Klein C, Gonzalez D, Samwel K, Kahesa C, Mwaiselage J, Aluthge N, et al. Relationship between the cervical microbiome, HIV status, and precancerous lesions. MBio. 2019;10:e02785–18.
Nené NR, Reisel D, Leimbach A, Franchi D, Jones A, Evans I, et al. Association between the cervicovaginal microbiome, BRCA1 mutation status, and risk of ovarian cancer: a case-control study. Lancet Oncol. 2019;20:1171–82.
King AJ, Sonsma JA, Vriend HJ, van der Sande MA, Feltkamp MC, Boot HJ, et al. Genetic diversity in the major capsid L1 protein of HPV-16 and HPV-18 in the Netherlands. PLoS One. 2016;11:e0152782.
Burk RD, Harari A, Chen Z. Human papillomavirus genome variants. Virology. 2013;445:232–43.
Artaza-Irigaray C, Flores-Miramontes MG, Olszewski D, Vallejo-Ruiz V, Limón-Toledo LP, Sánchez-Roque C, et al. Cross-hybridization between HPV genotypes in the linear Array genotyping test confirmed by next-generation sequencing. Diagn Pathol. 2019;14:31.
We thank the colleagues of Department of Pathology, Hong Kong Sanatorium & Hospital for their dedicated and professional work on routine laboratory diagnostics.
Ethics approval and consent to participate
This study was approved by Research Ethics Committee (REC) of Hong Kong Sanatorium & Hospital under the reference number RC-2019-18. No patient-identifying data was collected throughout the whole study.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Chan, W.S., Chan, T.L., Au, C.H. et al. An economical Nanopore sequencing assay for human papillomavirus (HPV) genotyping. Diagn Pathol 15, 45 (2020). https://doi.org/10.1186/s13000-020-00964-6