Skip to main content

A NSW Government website

Data.NSW

NSW Government Data Glossary

Terms A - C 

 
TermDefinitionSource
Accessibility The ease with which data or information can be retrieved, used and understood.NSW Government Standard for Data Quality Reporting
AccuracyThe degree to which the data or information correctly describes what they were designed to measure, monitor, or report, i.e., the consistency of data with reality.NSW Government Standard for Data Quality Reporting
 
Administrative data

Data or information created or collected through:

  • recording transactions or events
  • monitoring a process or service
  • querying another data or information source
  • regulatory activities and audit actions

Administrative data is often collected as a by-product of operations rather than for analytical purposes.

NSW Government Standard for Data Quality Reporting
AgencyA Public Service agency as defined by the Government Sector Employment Act – see also Schedule 1 of the GSE Act. 
Aggregate dataData that has been combined or de-identified, for example so that it no longer identifies specific individuals or locations.Digital.NSW – Managing Data & Information
Artificial intelligenceArtificial Intelligence, or AI, is intelligent technology, programs and the use of advanced computing algorithms that can augment decision making by identifying meaningful patterns in data.  NSW Government AI Strategy
Artificial Intelligence, machine learning and predictive analysisThe design, management and implementation of processes to enable machines to learn from experience, through processing large amounts of data and recognising patterns in the data, to automate activities or make predictions about the future.NSW Information Management Framework (IMF)
Business systemOrganised collection of hardware, software, supplies, policies, procedures and people, which stores, processes and provides access to an organisation’s business information.ISO 23081-2.2009 Part 2, Clause 3.3
Closed dataClosed data is restricted and may be sensitive or highly sensitive data. It can only be accessed by a particular business unit (its owner or custodian), or a subset of approved individuals or organisations for the authorised purpose for which they hold the data. Closed data is generally not released outside of the organisation, and in some cases it is not released outside of the business unit.NSW Data Leadership Group
ClusterThe nine groups into which NSW Government departments,  agencies, and state owned corporations are organised to enhance coordination and provision of related services and policy development (This reflects the Machinery of Government changes effective 1st July 2019).NSW Government Departments and agencies
CoherenceThe degree to which data or information can be compared with itself and other information over time.NSW Government Standard for Data Quality Reporting
ComparableInformation that can be compared, is similar, is worthy of comparison.
Comparability refers to the extent to which differences between statistics for different places or times can be attributed to real differences between the things being measured
 
NSW Government Standard for Data Quality Reporting
Creative Commons

Licensing framework used to facilitate open data practices. NSW Government Open Data Policy supports Creative Commons licensing for government data and information.

Creative Commons Australia
CustodianSee “Data Custodian” in Terms D-F 
CustomerA customer of NSW Government is anyone who lives, works, visits or invests in NSW. Customers include people and businesses who want to or are required to interact with Government, including people serving custodial sentences. The principles of customer service can be applied in the  regulatory environment. 
Cyber securityMeasures used to protect the confidentiality, integrity and availability of systems and information.NSW Cyber Security Policy

Terms D - F

 
TermDefinitionSource
DataData is a broad term, the definition of which is heavily impacted by context. Data generally refers to facts and figures that can be represented as numbers, text, graphics, sound or video, as well as how these are interpreted. Data can also take different forms e.g. digital, and can pertain to a range of topics or areas e.g. people, systems and the environment. Data can further be broken down by type or purpose, for example transactional and operational data.NSW Data & Information Custodian Policy
Data analyticsThe process of manipulating data in different ways with the goal of discovering insights.Digital.NSW – Managing Data & Information
Data assetA data asset is a structured collection of data developed for a broad purpose. An enduring data asset (or enduring linked data asset) is a subset of this category, denoting the linkage of a larger range of data that is designed for potentially many purposes and users. An example of this is the NSW Human Services Data Set. A data asset could also include models, methodologies and algorithms. 
Data breachAn incident that results in unauthorised access to modification or disruption of data, applications, services, networks and/or devices by bypassing their underlying security mechanisms.NSW Cyber Security Policy – Glossary
A data breach occurs when there is a failure that has caused or has the potential to cause unauthorised access to your Agency’s data. Although malware, hacking and data theft are usually the first examples of data breaches that come to mind, many breaches are a result of simple human or technical errors rather than malicious intent.Information and Privacy Commission NSW
Data custodian

The agency, body or position designated with the custody of a specified dataset or information asset. The custodian is responsible for:

  • developing, managing, care and maintenance of a specified dataset or information asset;
  • ensuring that all legal, regulatory and policy requirements are met in relation to the management of the specified dataset or information asset;
  • determining the conditions for appropriate use, sharing and distribution of the specified dataset or information asset.

This term is often used interchangeably with ‘Data Owner’.
 

NSW Data & Information Custodianship Policy, Appendix: A
Data ethicsAn evaluation of data practices with the potential to adversely impact on people and society – in collection, sharing and use. The Open Data Institute (ODI)
Data governanceImplementation of a set of policies, processes, structures, roles and responsibilities to ensure that an agency’s data is managed effectively and that it can meet its current and future business requirements.NSW Data Governance Toolkit
Data life cycleA data life cycle illustrates the stages of data management required over time, from the time of planning and creation to the time that data is either archived or destroyed. 
Data managementData management refers to the activities involved with managing data across the full lifecycle so that it is protected from unauthorised use and inappropriate deletion. Data needs to be appropriately managed from procurement or service design through to creation and final disposal. This includes protection of personal, health and sensitive information, and the prevention of deletion until enabled by legal authorisation.NSW Data Governance Toolkit
Data ownerThis term is often used interchangeably with ‘Data Custodian’.Information Data Management Framework (IDMF): Appendix B: Terminology
Data quality

Data quality is generally accepted as meaning “fitness for purpose”. The Australian Bureau of Statistics (ABS) Data Quality Framework contains seven characteristics of quality: 

  • Institutional environment
  • Relevance 
  • Timeliness 
  • Accuracy 
  • Coherence 
  • Interpretability 
  • Accessibility 

Data quality is evaluated in terms of how well the characteristics of the data meet the needs or objectives of a user. 

NSW Government Standard for Data Quality Reporting
Data securityWhat is considered a secure data storage, process, transmission or technology 
Data set (or dataset)A dataset is an identifiable collection of government-held information or data and associated metadata. 
Data sharingThe exchange of data between entities. Restrictions and controls imposed are contingent upon the data’s sensitivity and privacy impact. 
De-identified dataData that no longer contains, or never included, identifiers about a person, such that their identity is no longer apparent or reasonably ascertainable from the data. Re-identification also needs to be either impossible, or extremely difficult.Adapted from the IPC
Five Safes Framework

The Five Safes Framework was originally developed in the United Kingdom at the Office of National Statistics. The Framework consists of five elements which define independent but related aspects of disclosure risk: 

  • Safe Projects 
  • Safe Data 
  • Safe People 
  • Safe Settings 
  • Safe Outputs 

The Five Safes Framework has formed the basis of data protection legislation, policy and guidance for Commonwealth and State jurisdictions: 

Australian Bureau of Statistics (ABS)

Australian Institute of Health and Welfare (AIHW) 

Office of the National Data Commissioner (ONDC) 

New South Wales (NSW): Data.NSW 

South Australia (SA): Public Sector (Data Sharing) Act 2016

Victorian Data Sharing Legislation and Policy 

 

Terms G - L

 
TermDefinitionSource
Health information

As defined in section 6 of the Health Records and Information Privacy Act 2002 (NSW) (HRIP Act):

(a) personal information that is information or an opinion about—

(i) the physical or mental health or a disability (at any time) of an individual, or

(ii) an individual’s express wishes about the future provision of health services to him or her, or

(iii) a health service provided, or to be provided, to an individual, or

(b) other personal information collected to provide, or in providing, a health service, or

(c) other personal information about an individual collected in connection with the donation, or intended donation, of an individual’s body parts, organs or body substances, or

(d) other personal information that is genetic information about an individual arising from a health service provided to the individual in a form that is or could be predictive of the health (at any time) of the individual or of a genetic relative of the individual, or

(e) healthcare identifiers,

but does not include health information, or a class of health information or health information contained in a class of documents, that is prescribed as exempt health information for the purposes of this Act generally or for the purposes of specified provisions of this Act.

Health Records and Information Privacy Act 2002 (NSW) (HRIP Act), section 6
Indigenous dataIndigenous data is information or knowledge, in any format or medium, which is about and may affect Indigenous peoples both collectively and individually. 
Indigenous data governanceIndigenous Data Governance is the right of Indigenous peoples to autonomously decide what, how and why Indigenous Data are collected, accessed and used. It ensures that data on or about Indigenous peoples reflects their priorities, values, cultures, worldviews and diversity. 
Indigenous data sovereigntyIndigenous Data Sovereignty is a global movement concerned with the right of Indigenous peoples to govern the creation, collection, ownership and application of their data 
Information

Knowledge concerning objects such as facts, events, things, processes or ideas including concepts that within a certain context have a particular meaning. 

Information is data that has been processed into a form (physical, oral or electronic) that is meaningful to the recipient.  

This definition is included by not limited to: 

  • Raw data 
  • Information that has been produced by combining or adding value to raw data 
  • Images 
  • Audio-visual material 
  • Web content 
  • Records 
  • Metadata, policies and procedures 
  • Methodologies 
  • Models 
  • Dashboards 
  • Analysis 
  • Knowledge 
  • Strategies 
Information Data Management Framework (IDMF): Appendix B: Terminology
Information securityThe protection of information and information systems from unauthorised access, use, disclosure, disruption, modification or destruction in order to provide confidentiality, integrity and availability.NSW Cyber Security Policy - Glossary
Infrastructure dataData or information relating to the planning, design, construction, operation and maintenance of infrastructure.Information Data Management Framework: Appendix B: Terminology
InsightsMeaningful and actionable findings emerging from processed data, that can be leveraged to optimise decision-making processes. 
Institutional environmentA reporting quality relating to the institutional and oranisational factors which may have a significant influence on the effectiveness and credibility of the agency producing the data or information.NSW Government Standard for Data Quality Reporting
Internet of Things (IoT)The Internet of Things (IoT) refers to physical devices that are connected to the internet, collecting and sharing data. It is the global network of infrastructure, vehicles, wearable devices, home appliances, medical technologies and other objects that are embedded with electronics, software, sensors and actuators, enabling these ‘things’ to share and exchange data to perform their functions more efficiently and effectively (from NSW IoT Policy Guidance, p.1).NSW IoT Policy Guidance
InterpretabilityA reporting quality relating to the degree to which data or information can be understood, explained and used.NSW Government Standard for Data Quality Reporting

Terms M - Q

 
TermDefinitionSource
Master dataMaster data (or reference data) refers to the consistent set of identifiers and attributes that an organisation relies on to provide context for business transactions (e.g., information on customers, employees, locations, products and services).NSW Data Governance Toolkit
Memorandum of Understanding (MoU)A written agreement between two or more parties that defines the working relationship, expectations and responsibilities. MoUs are not usually legally binding on the parties. They are commonly used to clarify arrangements between non-corporate Commonwealth entities.Australian Government, Department of Finance
MetadataData or information that describes, defines and adds meaning to other data, to support its interpretation. 
Open data

Open data is the proactive or on demand release of data or insights for public use, in any form and through any channel, with approval from the data custodian/owner prior to its release. It is provided by Governments for general use to support public sector accountability, encourage innovation and to achieve broad community benefits. 


Open data must have low/negligible data sharing risks with governance frameworks and Open data publishing guidelines in place to manage privacy and safety prior to release. Data owners should provide users with descriptions that outline the definition and context of data to ensure data is correctly understood and interpreted. 


Data owners should be aware that open data is able to be used freely and without restrictions. As best practice, open data should have a Creative Commons Attribution Only (CC BY) license.   

NSW Data Leadership Group
Operational dataAny data which captures business activities. This can include compliance and administrative data. 

Personal information

(or personally identifiable information)

Information or an opinion (including information or an opinion forming part of a database and whether or not recorded in a material form) about an individual whose identity is apparent or can reasonably be ascertained from the information or opinion.Privacy and Personal Information Protection Act 1998 (NSW) (PPIP Act), section 4
Personal Information Factor (PIF) toolThe PIF tool is used to assess the risk of identifying an individual if they are not known to be in the datasets. If an individual is known to be in a dataset, the PIF tool provides a measure of the information that could be gained about them by accessing the dataset. The PIF outputs a score showing the distribution of ‘Row Information Gain’ (RIG) values for records in the data set. The highest value RIG defines the PIF for the data set.More information: Personal Information Factor (PIF) tool
PlatformA system or group of technologies. 
Quality assurance checksA system or series of activities for ensuring the maintenance of proper standards especially periodic interrogation and sampling of the product.NSW Government Standard for Data Quality Reporting

Terms R - Z

 
TermDefinitionSource
RecordAny document or other source of information compiled, recorded or stored in written form or on film, or by electronic process, or in any other manner or by any other means.State Records Act 1998

Any source of information created, received and maintained as evidence of the transaction of business. Examples include email approvals, outward correspondence, financial transactions in SAP. 

This information can be structured in business systems or reside in unstructured repositories.

 
RelevanceA reporting quality relating to how well data or information meets the needs of the user in terms of the concept(s) measured and the population(s) presented.NSW Government Standard for Data Quality Reporting
Semi-structured data

Semi-structured data typically contains tags or labels that are used to identify separate data elements for example:

  • email metadata that enables classification and keyword searching
  • IoT sensor data
  • JSON objects
Infrastructure Data Management Framework (IDMF)
Sensitive data or information

Per the NSW Government Information Classification, Labelling and Handling Guidelines (2020), sensitive information includes:

  • personal information
  • health information
  • information which could be subject to legal privilege
  • commercial-in-confidence information
  • law enforcement information
  • NSW Cabinet information.
NSW Government Information Classification, Labelling and Handling Guidelines (2020)
SensorA device which detects or measures a physical property and records, indicates or otherwise responds, including converting that detection into data. Examples of sensor detection/data include in relation to temperature, motion, pressure, light, smoke, and other environmental inputs. 
Shared data

Shared data is sensitive data that may be shared across government, or with trusted third parties. It may refer to a subset of data relevant to a specific purpose or recipient, with other parts of the wider dataset classified as Open or Closed data. Shared data usually contains sensitive information so that it needs approval from the data custodian/owner for an agreed, authorised purpose that benefits community outcomes, and with access controls in place in some scenarios. 


Most commonly, shared data is shared by Governments so it can be used to achieve broad community, economic, or environmental benefits. Shared data maintains a level of accessibility to a broader audience while managing privacy and security concerns. Arrangements such as data sharing agreements and licensing arrangements or frameworks, as well as training, are used to ensure safe data sharing and use in line with the data sharing principles such as the ‘Five Safes’ framework.

NSW Data Leadership Group
Spend categoryA spend category is the logical grouping of similar expenditure items or services that have been clearly defined on an organisational level. For example, “information technology” may be considered a spend category covering both IT software and hardware. 
Structured dataStructured data is comprised of clearly defined data types stored in accordance with a pre-defined schema or model. The most common example of structured data would be data stored in a relational database. Structured data for infrastructure includes geometrical and non-geometrical spatial data.Infrastructure Data Management Framework (IDMF)
SystemsSoftware, hardware, data, communications, networks and includes specialised systems such as industrial and automation control systems, telephone switching and PABX systems, building management systems and internet connected devicesNSW Cyber Security Policy - Glossary
Time SeriesA record of activity where data is measured at regular intervals over a period of time (e.g., a monthly unemployment rate). Time series assist understanding of the current situation, enabling the most recent data observations to be placed in a meaningful historical perspective.NSW Government Standard for Data Quality Reporting
TimelinessA reporting quality relating to:
•    the time taken between the occurrence of the characteristics/events being measured and the release of the data or information output; and
•    whether the data or information output is sufficiently up-to-date for the user's purpose.
 
NSW Government Standard for Data Quality Reporting
Transactional dataTransactional data describe an internal or external event or transaction that takes place as an organisation conducts its business. Examples include purchase orders and product sales. These data are typically grouped into transactional records, which include associated master and reference data.Definitions of Data Categories
Unstructured dataUnstructured data is comprised of data that has no pre-defined format or organisation and is usually not as easily searchable. It is most often categorised as qualitative data and can include formats like audio, video and social media postings. This kind of data typically resides in different databases/locations to structured data, and is difficult to process and analyse using conventional tools and methods such as organisation in relational databases. Finding insights from unstructured data is therefore complex and requires advanced analytics such as artificial intelligence/AI and a high level of technical expertise.Infrastructure Data Management Framework
UserEnd consumer of a Data or Information resource; those who use Data or Information for reference, or as input to solve problems and/or make decisions.NSW Data & Information Custodianship Policy

Change requests

To request the addition of a new term, or a change to an existing definition, please submit your response in the Microsoft Form here.


Last updated 12 Jul 2024