Dataset 354

A database of marine phytoplankton abundance biomass and species composition in Australian waters

Download raw data ↓ Download metadata ↓

Realm: Marine
Climate: Temperate/Tropical
Biome: Multiple ecoregions
Central latitude: -39.978763
Central longitude: 129.903106
Duration: 24 years, from 1928 to 2015

75604 records

734 distinct species

Across the time series Chaetoceros sp is the most frequently occurring species


Samples were mainly collected from Niskin bottles net drops or tows and the Continuous Plankton Recorder (CPR). These are all standard methods of collecting phytoplankton samples 1314 and many are still largely reliant on a phytoplankton manual written in 1978 15. A few samples were collected using automated samplers on moorings 16. The sampling was done via research vessels container ships and small boats by experienced researchers students and volunteers. The majority of the samples were preserved using Lugol?s solution although formalin paraformaldehyde and glutaraldehyde have also been used. Different methods of preservation can affect the condition of the sample and which taxa are well preserved 417. Samples were analysed with standard methods including light microscopy transmission or scanning electron microscopy which are described in Hallegraeff et al. 4. All methodological variations within our phytoplankton database are detailed in the metadata where available and recorded for each data entry. Where available a citation is referenced for each project which gives details on methodologies and limitations from that project (Table 1). There were three stages in data gathering. The first stage was to conduct a literature review of Australian phytoplankton data. Any literature that contained abundance or presence data was digitised and uploaded into the CSIRO maintained Oracle database. The second stage was to scan already existing databases such as the CSIRO data trawler the Ocean Biogeographic Ocean System (OBIS) and the Atlas of Living Australia (AOLA). These repositories only store presence records. Relevant data were selected and uploaded into the database. The third stage was to ask researchers to contribute any other data sets that they had. All data were organised into a standard format and uploaded into the database. Data were then served to and hosted by the AODN. All taxa have been verified as accepted species and given the currently accepted name as defined by the World Register of Marine Species database (WoRMS - If any taxa could not be verified then a second check was done through AlgaeBase ( If this did not verify that taxa as a valid name then the taxonomic level of identification was decreased to a satisfactory level or the entry removed. All abundance values were standardised to cells.L-1 or are given as presence only. Records of the original identifications and units were archived so any records can be checked. Identification of the smaller phytoplankton is often to a coarser taxonomic level as many cannot be distinguished to species using light microscopy. In some studies electron microscopy has been used to determine species but in other studies functional groups have been identified. This data set does not include accessory pigment data which can help resolve these smaller taxa 18 although it can be thought of as a complementary dataset. Over 20 years of pigment data are available in Australia via the AEsOP database ( Cell biovolume is calculated as per Hillebrand et al. 19 following the suggested shape factors for each genera. Size parameters were estimated from measurements taken from Australian samples or Australian references where available420 and other sources where not21-23. In some data sets direct measurements of size classes (e.g. P599 the Australian Continuous Plankton Recorder Survey and P597 the IMOS National Reference Stations) were used in preference to literature values. For some taxa there was insufficient information available to estimate a biovolume these were generally the rare taxa. Rather than estimate a size class without any information available these have been left blank. _____ Data Records: Each data record represents the abundance or presence of a phytoplankton taxa at a certain point in space and time and has been given a unique record identification number P(project_id)_(sample_id)_(record_id). Each data record belongs to a project with each project having a unique identification number Pxxx. A project is defined as a set of data records which have been collected together usually as a cruise or study with the same sampling method and having the same person counting the samples. Metadata ascribed to a project relates to all the data records within that project. Details to identify each separate project are given in Table 1. Each sample within that project has a unique sample_id. The sample_id has not been changed from the original data set to maintain traceability. So these may be duplicated between projects but P(project_id)_(sample_id) will be a unique entity in space and time. Species abundance records within the sample are given a unique record_id. The majority of these projects have been uploaded as part of the collation of data for this database. The IMOS National Reference Stations P599 and Continuous Plankton Recorder Survey P597 data which together constitute half the data in this database are available through the AODN (Data citation 12). Table 1 gives summary information on the project data sets their space time and taxonomic resolutions numbers of samples and records available. Users can select data sets from this information and download as desired through the AODN. _____ Technical Validation: The Phytoplankton Database will provide an extensive resource for phytoplankton researchers although there are some caveats due to the variety of the sampling and analysis protocols. The various sample collection methods infer that abundances might not always be directlycomparable across projects. For example quantitative methods such as bottle sampling (e.g. Project 599) will collect all but the rarest phytoplankton and will include the whole size spectrum whereas semi-quantitative methods such as net sampling are selective and dependent on tow method mesh size and the mix of species present in the water as some species may clog the net and trap smaller species that would otherwise go through the mesh (e.g. Project 509)21. By including all data collected using different methods and including this information as meta-data researchers are able to analyse the relative abundance of each taxa within a project and compare across compatible projects. Metadata includes as much detail as is available about sampling methods and limitations and provides guidance to the users about the potential of each data set. Users should consider collection methods preservation techniques and microscopic limitations when comparing datasets. All datasets have been standardised to taxa/m3 of water except P1070 where the units are taxa per gram of substrate measured. This project collected the phytoplankton by collecting substrates and analysing parts of the substrate. It is included here as the only data set on Gambierdiscus and associated benthic dinoflagellates from Australia which are important to the studies of the ciguatera. All datasets submitted can be interpreted as confirming the presence of those species recorded. In some datasets e.g. time series it is possible also to infer absences assuming that all species are looked for on each sampling occasion. Absences have been included in the data sets where the project information available allowed us the confidence to interpret such absences correctly. The interpretation of absences from other projects is at the discretion of the user. We suggest that a project-by-project approach should be taken. If a taxa is notobserved at all in a project then the absence could be due to the taxa not being present that taxa not being of interest to the analyst or the inability to identify that taxa. Thus a real absence should not be inferred. If a taxa is observed in some samples of a project it can most likely be assumed that the microscopist could identify the taxa and that a real absence may be inferred in samples within that project where the taxa was not marked as present. Some data records were removed when there was ambiguity as to the identification of the taxa i.e. when the taxonomic traceability usually from older sources was confused or whenspelling mistakes make it unclear which one of two species was meant. Species known as freshwater species were removed as the methods used to collect data were not aimed at freshwater environments and the inclusion of the odd records of these species would not be comprehensive or meaningful. Estuarine species were captured and the records kept. Data records with positions on land with an unreal number for abundance or with impossible dates were also removed or converted to presence records. ________ Usage Notes: The database contains information on the functional group of the species which can aid analysis. Functional groups include diatoms dinoflagellates flagellates ciliates (including tintinnids) silicoflagellates and cyanobacteria. Once downloaded and binned as required data are suitable for use in the creation of ecological indicators. For example: ? Total diatoms dinoflagellates ? Diatom:Dinoflagellate ratio ? Total phytoplankton abundance or biomass per degree square. Abundance (cells.L-1) for the phytoplankton counts is given where it is available providing more information about the productivity of an area than presence data alone. A low cell abundance may indicate a low level or production but this may not be the case if these are large cells. The biomass data helps to show productivity of an area. The biovolume has been calculated for each cell count and when converted to biomass is available for use by modellers and to assist in interpretation of an area?s productivity. An accepted method of converting biovolume to biomass is to assume that the cell has the density of water (1 mm3 .L-1 = 1 mg.L- 1 ) 13. Another useful conversion is to carbon biomass; full methods are readily found in the literature24-26. Table 2 gives details conversions of phytoplankton size data to carbon biomass. Some of the data records are missing dates or coordinates. It was considered useful to keep these records as the presence of a taxa in a location may still be of value. The user may estimate coordinates from the location given and would then also be aware that these wouldnot be the exact coordinates of the sample. Data can be analysed in many different ways and in many software applications (e.g. R Matlab). We include here some figures created in The R Project for Statistical Computing ( to demonstrate some potential uses of the data (Fig 3). In some cases notably project 599 the IMOS National Reference Stations the phytoplankton component of the survey is only a part of the data available. Additional biogeochemical data are available for this data set via the AODN. Some of the projects viz. P479 P599 P597 have corresponding zooplankton data freely available in The Australian Zooplankton Database 27 . These data sets can be matched by the project_id and the sample_id which are consistent across databases. The list of citations referenced in Table 1 will also give users information as to how this data has been previously used from the discrete projects. Niskin bottles net drops or tows and the Continuous Plankton Recorder (CPR) Unit of abundance = CountPerSqM, Unit of biomass = Weight


Davies, C. H., Coughlan, A., Hallegraeff, G., Ajani, P., Armbrecht, L., Atkins, N., Bonham, P., Brett, S., Brinkman, R., Burford, M., Clementson, L., Coad, P., Coman, F., Davies, D., Dela-Cruz, J., Devlin, M., Edgar, S., Eriksen, R., Furnas, M., Hassler, C., Hill, D., Holmes, M., Ingleton, T., Jameson, I., Leterme, S. C., Lønborg, C., McLaughlin, J., McEnnulty, F., McKinnon, A. D., Miller, M., Murray, S., Nayar, S., Patten, R., Pritchard, T., Proctor, R., Purcell-Meyerink, D., Raes, E., Rissik, D., Ruszczyk, J., Slotwinski, A., Swadling, K. M., Tattersall, K., Thompson, P., Thomson, P., Tonks, M., Trull, T.W., Uribe-Palomino, J., Waite, A. M., Yauwenas, R., Zammit, A. & Richardson, A. J. (2016) A database of marine phytoplankton abundance, biomass and species composition in Australian waters. Scientific Data, 3, 160043. http://10.1038/sdata.2016.43.