\(\renewcommand\AA{\unicode{x212B}}\)
Extract and Manipulate Data: Examples¶
Read vs Extract¶
Read produces a view into the chosen part of the original data.
Extract creates a copy of this part of the data.
Read¶
Using a loop, read and print the first value in all spectra
for index in range(0, raw_workspace.getNumberHistograms()):
#Note the round brackets followed by the square brackets
print(raw_workspace.readY(index)[0])
Workspace data can be read as numpy arrays, spectrum by spectrum:
ws = Load(Filename="HRP39182.RAW")
for i in range(ws.getNumberHistograms()):
y = ws.readY(i)
x = ws.readX(i)
e = ws.readE(i)
Warning¶
Be careful: the outputs of read (y,x,e) are only views into the data held by the workspace, ws. If ws is deleted, the contents of x,y,e will be invalid (the random contents of the memory locations formerly used for ws). If you need x,y,e data to persist longer than the workspace, use the extract methods, which create a copy of the data in ws into y,x,e.
Extract¶
The data from all spectra can be obtained as a mutable multi-dimensional array in one-call using the extract methods.
ws = Load(Filename="HRP39182.RAW")
x = ws.extractX()
y = ws.extractY()
e = ws.extractE()
print(x.shape)
print(y.shape)
print(e.shape)
Since the extract methods return multi-dimensional numpy arrays. So to use extract in a similar way to read, you need to slice these arrays with indexing.
E.g. instead of ws.readX(5) you should use:
ws.extractX()[5, :]
xmat = ws.extractX(); x = xmat[5, :]
Nested Looping¶
This allows access to the individual bins in each spectrum. E.g. to sum the y-values in each spectrum:
ws = Load(Filename="HRP39182.RAW")
ws = Rebin(InputWorkspace=ws, Params=1e4) # Rebin to make the looping more manageable.
# Outer loop. Loop over spectrum
for i in range(ws.getNumberHistograms()):
y = ws.readY(i)
sum_counts = 0
# Inner loop. Loop over bins.
for j in range(ws.blocksize()):
sum_counts += y[j]
# Display spectrum number against sum_counts
print("Spectrum Number: {0}, Total Counts: {1}".format(ws.getSpectrum(i).getSpectrumNo(), sum_counts))
Creating Output Workspaces¶
We may perform some processing on the data arrays before creating our new workspace.
Creating a MatrixWorkspace¶
Use CreateWorkspace v1, with the correct input arrays.
E.g. Change the x-axis for TOF from microseconds to milliseconds:
Enable :plots: using DOCS_PLOTDIRECTIVE in CMake
Creating a TableWorkspace¶
Use CreateEmptyTableWorkspace v1 and addColumn() and addRow() as needed. Refer back to TableWorkspace with Python
E.g. To read out the value in the first bin for each spectrum:
ws = Load(Filename="GEM40979.RAW")
table = CreateEmptyTableWorkspace()
table.addColumn('int', 'Spectrum Number')
table.addColumn('double', 'First Bin Value')
for i in range(ws.getNumberHistograms()):
specNumber = ws.getSpectrum(i).getSpectrumNo()
# read each spectrum, just the first bin
y = ws.readY(i)[0]
table.addRow([specNumber,y])