![]() |
Data Quality Assessment |
Quality control of Gemini data is achieved by performing data quality assessment (QA). The QA processes are illustrated in the following flowchart and described below in more detail:
Quality Assesement, in this form at least, is applicable only to queue (and QuickStart service mode) data.
Observations awaiting execution are stored in the Observing Database. The database (not in its final form during early science use) will contain ingested Phase I proposals and Phase II details defined by the PI. The database also contains the status of an observation, including its location within the QA process.
Execution is carried out at the observation level not the science program level. That is, flexibility to respond to changing conditions, for example, is provided by switching between programs. Thus an observation is the minimum schedulable entity. Each observation may generate many datasets (when executed by the OCS, one OBSERVE command generates one dataset). Each dataset is a single FITS file. For example, multiple repetitions of a mosaic or offsets between object and sky form one observation.
Real-time quality assessment is performed by the observer taking the data (usually at the summit). This is intended to be a very simple level of assessment and is applied to all datasets. There are two parts: (a) identify whether or not the dataset meets the observing condition criteria defined by the PI and (b) test against simple Gemini criteria to determine whether the data is usable for doing science.
For all queue observations, the PI will have specified four observing conditions: image quality, cloud cover, water vapor content and sky background. The conditions are defined to be within one of several percentile bins. By comparing the requested and current conditions the QA identifies whether the PI criteria are met. Therefore the process adds four header keywords to each dataset (RAW(IQ|CC|WV|BG) with the values being the current observing condition bin (e.g. 20-percentile). Another header keyword identifies whether the overall PI requirements are met (RAWPIREQ, with values of yes|no|unknown). The "unknown" value is required because it may not be possible to determined the image quality without significant data processing.
The criteria of scientific utility is whether or not the dataset contains useful scientific information as determined by simple visual inspection. To pass this test it is not necessary that the dataset be usable in the program for which it was taken (which is why it is denoted as a Gemini, not PI, criterion). The intent is to ensure that all usable data enters the science archive. As examples, data taken in which the shutter failed to open or the dome caused severe vignetting would be classified as "bad". An exposure in which only three of the four detector quadrants returned useable data or in which the target was saturated might be serviceable for another program. This process adds one header item to the dataset: RAWGEMQA with a value of (bad|usable|unknown). If the dataset is bad then the observation is re-scheduled in the queue. (NB: it need not be re-observed immediately depending on the reason for bad data).
The observation status in the database is updated after QA.
Off-line data processing is to be performed in an integrated data processing pipeline. Until that is available, off-line data processing is performed semi-manually at sea level by the Gemini science staff using the same data processing software scripts that form the elemental components of the pipeline. The details of the processing recipe depend on the instrument configuration, telescope and instrument sequencing. Simple sequences have been employed for QuickStart service observing. The observation status in the database is updated after processing.
The final image (or spectrum) derived from off-line data processing is subjected to further QA. For the immediate future, the only PI criteria for overall success is total integration time; longer term the intent is to adopt measurements of precision e.g. flux limit. The image quality of the final stacked data may differ from that requested by the PI, even if the individual datasets are in accordance, and is verified. If the criteria are not met the observation may be re-queued (see the next section).
The Gemini criterion is that the observation was executed as defined by the PI i.e. the correct instrument configuration (e.g. filter) and target (co-ordinates) was used. These attributes are assessed by comparing the data headers with the proposal. If the criteria are not met the observation may be re-queued (see the next section).
The process adds further headers to the final QA data product: FINALINT, FINALIQ, PICONFIG, PITARGET all with values of (pass|fail).
The off-line data processing also generates a brief report for the Contact Scientist containing the following information: (a) QA header items added during real-time QA for each raw dataset, (b) QA header items from off-line QA of final product and (c) a summary of off-line data processing noting any difficulties encountered (e.g. flat fielding problems, non-functioning detector quadrant).
The observation status in the database is updated after QA.
The Gemin Contact Scientist (CS) is responsible for approving the distribution and packaging of the data.
If the off-line data QA is passed (i.e. all header values are "pass"), the real-time data QA is passed (i.e. header values are "OK" and "usable") and there are no significant issues with data processing then the data is packaged and distributed to the PI and to the science archive with the normal proprietary period.
Observations will be scheduled to be repeated if there was significant error by the Gemini Staff, technical failure or a substantial change in the observing conditions during execution. Examples of "significant error" are incorrect instrument configuration (wrong filter, grating, slit etc.) or the target is not in the field of view. A "substantial change" in the observing conditions means that the conditions degraded by at least 50% from those requested during the majority of the observation. An example would be a delivered image quality of 0.45 arcsec when 0.3 arcsec was requested.
The value of 50% is set to provide some margin during early ("shared-risks") science operations. The intent is to reduce this value to zero as experience with the telescope, instrument and environmental conditions are obtained.
The decision on whether an observation passes this criteria is made by the Gemini CS for that program, in consultation with the Instrument Specialist and Observer if necessary, but does not involve the PI.
An observation that was defined incorrectly by the PI will not be repeated.
As is the case for any queue-mode programs, there is no guarantee that a re-scheduled observation will be repeated. The re-scheduled observation will have the same weighting in the queue as the original observation.
Feedback is added into the description of the science program so that the problem can be avoided when it is re-observed.
When an observation is scheduled to be repeated, the data from the original observation will be distributed to the PI and to the science archive with the normal proprietary period. The time used will be charged to a separate category of "repeated observations". This comes off the top before balancing the partner shares. The aggregated "repeated observations" account is one metric of Observatory performance and will be included in the overall report of time usage to the ITAC, GSC, CGO and Gemini Board.
In all cases the status of the observation in the database is updated.
Last update September 22, 2000; Phil Puxley