Record Details

2D Acoustic Numerical Breast Phantoms and USCT Measurement Data

Harvard Dataverse (Africa Rice Center, Bioversity International, CCAFS, CIAT, IFPRI, IRRI and WorldFish)

View Archive Info
 
 
Field Value
 
Title 2D Acoustic Numerical Breast Phantoms and USCT Measurement Data
 
Identifier https://doi.org/10.7910/DVN/CUFVKE
 
Creator Li, Fu
Villa, Umberto
Park, Seonyeong
Anastasio, Mark
 
Publisher Harvard Dataverse
 
Description Companion dataset of the manuscript:


Fu Li, Umberto Villa, Seonyeong Park, Mark A. Anastasio. Three-dimensional stochastic numerical breast phantoms for enabling virtual imaging trials of ultrasound computed tomography. Arxiv preprint 2106.02744 (2021)



This dataset includes a collection of 52 two-dimensional slices of numerical breast phantoms (NBPs) and corresponding ultrasound computed tomography (USCT) simulated measurement data. The anatomical structures of these NBPs were obtained by use of tools from the Virtual Imaging Clinical Trial for Regulatory Evaluation (VICTRE) project. More details on the modification and extension of the VICTRE NBPs for use in USCT studies can be found in the accompanying paper.


The NBPs included in this dataset are representative of four ACR BI-RADS breast composition types:

  • A. The breast is almost entirely fatty

  • B. There are scattered areas of fibroglandular density

  • C. The breasts is heterogeneously dense

  • D. The breast is extremely dense



Each NBP contains 2D maps of tissue labels, speed of sound, acoustic attenuation, density. A low-resolution speed-of-sound map is also provided to reproduce the FWI reconstruction results presented in the accompanying paper.


Corresponding USCT measurement data were simulated by modeling 2D wave propagation in lossy heterogeneous media using a time explicit pseudospectral wave propagation solver.


The dataset consists of three folders:

  • The 2d_slices folder contains the 52 slices extracted from 3D NBPs.

  • The measurements folder contains simulated measurement data corresponding to each slice.

  • The imaging_system folder contains information about the 2D imaging system (excitation source, transducer coordinates)



In addition, the following helper Matlab scripts are included:

  • read_data.m: Helper function to load and visualize the excitation source and transducer locations.

  • read_waveform_data.m: Helper function to read the .h5 files containing the measurement data.



Each slice is saved as a binary MATLAB file (.mat) and contains the following variables

  • label: tissue label map with [2560,2560] pixels and 0.1mm pixel size. Tissue types are denoted using the following labels: water (0), fat (1), skin (2), glandular (29), ligament (88), lesion (200).

  • sos: speed of sound map (mm/μs) with [2560,2560] pixels and 0.1 mm pixel size. Data is stored as data type float32.

  • aa: acoustic attenuation map (Np/m/MHzy) with [2560,2560] pixels and 0.1mm pixel size. Data is stored as data type float32.

  • density: density map (kg/mm3) with [2560,2560] pixels and 0.1 mm pixel size. Data is stored as data type float32.

  • sos_ini: low resolution speed of sound map (mm/μs) with [1280,1280] pixels and 0.2mm pixel size. Data is stored as data type float32. This is the initial guess used in the speed of sound reconstructions in our paper.

  • y: attenuation exponent used for simulation.

  • seed: phantom id
  • type: breast composition type (A-D)



The simulated measurement data is saved in hdf5 format. Measurement data corresponding the i-th emitting transducer is stored with hdf5 key equal to the transducer index as a two-dimensional array of size [1024,4250]. Here, the rows represent the receiver index, and the columns the time sample. The sampling frequency is 25MHZ. Because of file size limitations, measurement data for each slice has been divided into 8 chunks, containing data from 128 receivers each.

The imaging_system folder contains information regarding the 2D imaging system.

  • source300.mat describes the time profile of the exitation pulse. It consists of 300 time samples at a sampling frequency of 25Mhz.

  • locations1024.mat provide the xy coordinates (mm)of the location of each transducer Data type is float32. Array size is [2x1024].



Warning: This is a very large dataset (~1TB). Please check out our download script written in python.
 
Subject Medicine, Health and Life Sciences
Other
Medical imaging
Ultrasound computed tomography
Numerical phantom
 
Contributor Villa, Umberto