Skip to main content

Main menu

  • Home
  • Content
    • Current Issue
    • Past Issues
    • Accepted Articles
    • Email Alerts
    • RSS
    • Terms of Use
  • About PDA JPST
    • JPST Editors and Editorial Board
    • About/Vision/Mission
    • Paper of the Year
  • Author & Reviewer Resources
    • Author Resources / Submit
    • Reviewer Resources
  • JPST Access and Subscriptions
    • PDA Members
    • Institutional Subscriptions
    • Nonmember Access
  • Support
    • Join PDA
    • Contact
    • Feedback
    • Advertising
    • CiteTrack
  • .
    • Visit PDA
    • PDA Letter
    • Technical Reports
    • news uPDATe
    • Bookstore

User menu

  • Register
  • Subscribe
  • My alerts
  • Log in
  • My Cart

Search

  • Advanced search
PDA Journal of Pharmaceutical Science and Technology
  • .
    • Visit PDA
    • PDA Letter
    • Technical Reports
    • news uPDATe
    • Bookstore
  • Register
  • Subscribe
  • My alerts
  • Log in
  • My Cart
PDA Journal of Pharmaceutical Science and Technology

Advanced Search

  • Home
  • Content
    • Current Issue
    • Past Issues
    • Accepted Articles
    • Email Alerts
    • RSS
    • Terms of Use
  • About PDA JPST
    • JPST Editors and Editorial Board
    • About/Vision/Mission
    • Paper of the Year
  • Author & Reviewer Resources
    • Author Resources / Submit
    • Reviewer Resources
  • JPST Access and Subscriptions
    • PDA Members
    • Institutional Subscriptions
    • Nonmember Access
  • Support
    • Join PDA
    • Contact
    • Feedback
    • Advertising
    • CiteTrack
  • Follow pda on Twitter
  • Visit PDA on LinkedIn
  • Visit pda on Facebook
Research ArticleResearch

Systematic Design, Generation, and Application of Synthetic Datasets for Flow Cytometry

Melissa Cheung, Jonathan J. Campbell, Robert J. Thomas, Julian Braybrook and Jon Petzing
PDA Journal of Pharmaceutical Science and Technology May 2022, 76 (3) 200-215; DOI: https://doi.org/10.5731/pdajpst.2021.012659
Melissa Cheung
1Centre for Biological Engineering, Loughborough University, Loughborough, Leicestershire, United Kingdom; and
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: M.Cheung@lboro.ac.uk
Jonathan J. Campbell
2National Measurement Laboratory, LGC, Teddington, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Robert J. Thomas
1Centre for Biological Engineering, Loughborough University, Loughborough, Leicestershire, United Kingdom; and
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Julian Braybrook
2National Measurement Laboratory, LGC, Teddington, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jon Petzing
1Centre for Biological Engineering, Loughborough University, Loughborough, Leicestershire, United Kingdom; and
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Article
  • Figures & Data
  • References
  • Info & Metrics
  • PDF
Loading

Abstract

Application of synthetic datasets in training and validation of analysis tools has led to improvements in many decision-making tasks in a range of domains from computer vision to digital pathology. Synthetic datasets overcome the constraints of real-world datasets, namely difficulties in collection and labeling, expense, time, and privacy concerns. In flow cytometry, real cell-based datasets are limited by properties such as size, number of parameters, distance between cell populations, and distributions and are often focused on a narrow range of disease or cell types. Researchers in some cases have designed these desired properties into synthetic datasets; however, operators have implemented them in inconsistent approaches, and there is a scarcity of publicly available, high-quality synthetic datasets. In this research, we propose a method to systematically design and generate flow cytometry synthetic datasets with highly controlled characteristics. We demonstrate the generation of two-cluster synthetic datasets with specific degrees of separation between cell populations, and of non-normal distributions with increasing levels of skewness and orientations of skew pairs. We apply our synthetic datasets to test the performance of a popular automated cell populations identification software, SPADE3, and define the region where the software performance decreases as the clusters get closer together. Application of the synthetic skewed dataset suggests the software is capable of processing non-normal data. We calculate the classification accuracy of SPADE3 with robustness not achievable with real-world datasets. Our approach aims to advance research toward generation of high-quality synthetic flow cytometry datasets and to increase their awareness among the community. The synthetic datasets can be used in benchmarking studies that critically evaluate cell population identification tools and help illustrate potential digital platform inconsistencies. These datasets have the potential to improve cell characterization workflows that integrate automated analysis in clinical diagnostics and cell therapy manufacturing.

  • Flow cytometry
  • Synthetic datasets
  • Clusters
  • Separation
  • Skew
  • Accuracy
  • Repeatability
  • © PDA, Inc. 2022
View Full Text

PDA members receive access to all articles published in the current year and previous volume year. Institutional subscribers received access to all content. Log in below to receive access to this article if you are either of these.  

If you are neither or you are a PDA member trying to access an article outside of your membership license, then you must purchase access to this article (below). If you do not have a username or password for JPST, you will be required to create an account prior to purchasing. 

Full issue PDFs are for PDA members only.

Note to pda.org users

The PDA and PDA bookstore websites (www.pda.org and www.pda.org/bookstore) are separate websites from the PDA JPST website. When you first join PDA, your initial UserID and Password are sent to HighWirePress to create your PDA JPST account. Subsequent UserrID and Password changes required at the PDA websites will not pass on to PDA JPST and vice versa. If you forget your PDA JPST UserID and/or Password, you can request help to retrieve UserID and reset Password below.

Log in using your username and password

Forgot your user name or password?

Log in through your institution

You may be able to gain access using your login credentials for your institution. Contact your library if you do not have a username and password.
If your organization uses OpenAthens, you can log in using your OpenAthens username and password. To check if your institution is supported, please see this list. Contact your library for more details.

Purchase access

You may purchase access to this article. This will require you to create an account if you don't already have one.

patientACCESS

patientACCESS - Patients desiring access to articles

Full issue PDFs are for PDA members only. You can join PDA at www.pda.org. 

PreviousNext
Back to top

In This Issue

PDA Journal of Pharmaceutical Science and Technology: 76 (3)
PDA Journal of Pharmaceutical Science and Technology
Vol. 76, Issue 3
May/June 2022
  • Table of Contents
  • Index by Author
  • Complete Issue (PDF)
Print
Download PDF
Article Alerts
Sign In to Email Alerts with your Email Address
Email Article

Thank you for your interest in spreading the word on PDA Journal of Pharmaceutical Science and Technology.

NOTE: We only request your email address so that the person you are recommending the page to knows that you wanted them to see it, and that it is not junk mail. We do not capture any email address.

Enter multiple addresses on separate lines or separate them with commas.
Systematic Design, Generation, and Application of Synthetic Datasets for Flow Cytometry
(Your Name) has sent you a message from PDA Journal of Pharmaceutical Science and Technology
(Your Name) thought you would like to see the PDA Journal of Pharmaceutical Science and Technology web site.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
11 + 5 =
Solve this simple math problem and enter the result. E.g. for 1+3, enter 4.
Citation Tools
Systematic Design, Generation, and Application of Synthetic Datasets for Flow Cytometry
Melissa Cheung, Jonathan J. Campbell, Robert J. Thomas, Julian Braybrook, Jon Petzing
PDA Journal of Pharmaceutical Science and Technology May 2022, 76 (3) 200-215; DOI: 10.5731/pdajpst.2021.012659

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Share
Systematic Design, Generation, and Application of Synthetic Datasets for Flow Cytometry
Melissa Cheung, Jonathan J. Campbell, Robert J. Thomas, Julian Braybrook, Jon Petzing
PDA Journal of Pharmaceutical Science and Technology May 2022, 76 (3) 200-215; DOI: 10.5731/pdajpst.2021.012659
Twitter logo Facebook logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Jump to section

  • Article
    • Abstract
    • 1. Introduction
    • 2. Materials and Methods
    • 3. Discussion
    • 4. Conclusion
    • Conflict of Interest Declaration
    • Acknowledgments
    • References
  • Figures & Data
  • References
  • Info & Metrics
  • PDF

Related Articles

  • No related articles found.
  • PubMed
  • Google Scholar

Cited By...

  • No citing articles found.
  • Google Scholar

More in this TOC Section

  • Quantitative and Qualitative Evaluation of Microorganism Profile Identified in Bioburden Analysis in a Biopharmaceutical Facility in Brazil: Criteria for Classification and Management of Results
  • Evaluation of Extreme Depyrogenation Conditions on the Surface Hydrolytic Resistance of Glass Containers for Pharmaceutical Use
  • A Holistic Approach for Filling Volume Variability Evaluation and Control with Statistical Tool
Show more Research

Similar Articles

Keywords

  • Flow cytometry
  • Synthetic datasets
  • Clusters
  • Separation
  • Skew
  • Accuracy
  • Repeatability

Readers

  • About
  • Table of Content Alerts/Other Alerts
  • Subscriptions
  • Terms of Use
  • Contact Editors

Author/Reviewer Information

  • Author Resources
  • Submit Manuscript
  • Reviewers
  • Contact Editors

Parenteral Drug Association, Inc.

  • About
  • Advertising/Sponsorships
  • Events
  • PDA Bookstore
  • Press Releases

© 2025 PDA Journal of Pharmaceutical Science and Technology Print ISSN: 1079-7440  Digital ISSN: 1948-2124

Powered by HighWire