Commission Regulation (EU) No 1151/2010 of 8 December 2010 implementing Regulation (EC) No 763/2008 of the European Parliament and of the Council on population and housing censuses, as regards the modalities and structure of the quality reports and the technical format for data transmission Text with EEA relevance
Article 1
Subject matter
This Regulation lays down the modalities and structure of the quality reports to be submitted by Member States on the quality of the data they transmit to the Commission (Eurostat) from their population and housing censuses for the reference year 2011, as well as the technical format for data transmission, to fulfil the requirements of Regulation (EC) No 763/2008.
Article 2
Definitions
The definitions and technical specifications set out in Regulation (EC) No 763/2008 and Commission Regulations (EC) No 1201/2009 (1) and (EU) No 519/2010 (2) shall apply for the purpose of this Regulation. The following definitions shall also apply:
‘statistical unit’ means the basic observation unit, namely a natural person, household, family, living quarter, or conventional dwelling;
‘individual enumeration’ means that information on each statistical unit is obtained so that their characteristics can be recorded separately and cross-classified with other characteristics;
‘simultaneity’ means that the information obtained in a census refers to the same point in time (reference date);
‘universality within a defined territory’ means that data are provided for all statistical units within a precisely defined territory. Where statistical units are persons, ‘universality within a defined territory’ means that data are provided which are based on information for all persons that have their usual residence in the defined territory (total population);
‘availability of small-area data’ means the availability of data for small geographic areas and for small groups of statistical units;
‘defined periodicity’ means the capacity to conduct censuses regularly at the beginning of every decade, including the continuity of registers;
‘target population’ means the set of all statistical units in a defined geographical area at the reference date which qualify for reporting on one or more specified topics. The target population includes each valid statistical unit exactly once;
‘estimated target population’ means the best available approximation of the target population. The estimated target population consists of the census population plus under-coverage minus over-coverage;
‘census population’ means the set of statistical units which is factually represented by the census results on one or more specified topics for a specified target population. The data records for the census population are the data records in the data source for the specified target population, including all imputed records and excluding all deleted records. If a data source comprises, as a matter of methodological principle, data records for only a sample of the statistical units in its estimated target population, the census population comprises, in addition to the statistical units in the sample, the complementary set of statistical units;
‘complementary set of statistical units’ means the set of those statistical units that belong to an estimated target population, but about which the data source contains no data records as a result of an applied sampling methodology;
‘coverage assessment’ means a study of the difference between a specified target population and its census population;
‘post-enumeration survey’ means a survey conducted shortly after the enumeration for coverage and content assessment purposes;
‘under-coverage’ means the set of all statistical units that belong to a specified target population, but are not included in the corresponding census population;
‘over-coverage’ means the set of all statistical units that are included in a census population used to report on a specified target population without belonging to that target population;
‘record imputation’ means the assignment of an artificial but plausible data record to exactly one geographical area at the most detailed geographical level for which census data are produced, and the imputation of that data record into a data source;
‘record deletion’ means the act of deleting or ignoring a data record that is included in a data source used to report on a specified target population, but does not report any valid information on any statistical unit within that target population;
‘item imputation’ means the insertion of artificial but plausible information into a data record where the data record already exists in a data source but does not contain this information;
‘data source’ means the set of data records for statistical units and/or events related to statistical units which forms a basis for the production of census data about one or more specified topics for a specified target population;
‘register-based data’ means data that are in or originate from a register;
‘questionnaire-based data’ means data that are originally obtained from respondents by the means of a questionnaire in the context of a collection of statistical data which refer to a specified point in time;
‘register’ means a repository which stores information about statistical units and is directly updated in the course of events affecting the statistical units.
‘record linkage’ means the process of merging information from different data sources by comparing the records for the individual statistical units and merging the information for each statistical unit where the unit to which the records refer is the same;
‘matching of registers’ means a record linkage where all matched data sources are contained in registers;
‘data extraction’ means the process of retrieving census information from information contained in a register and relating to individual statistical units;
‘coding’ means the process of converting information into codes representing classes within a classification scheme;
‘identifying variable’ means a variable in the data records in a data source or any list of statistical units which is used
— to evaluate whether the data source (or list of statistical units) includes no more than one data record for each statistical unit, and/or — for a record linkage.
‘capturing’ means the process by which collected data are put into a machine-readable form;
‘record editing’ means the process of checking and modifying data records to make them plausible while at the same time preserving major parts of these records;
‘generation of a household’ means the identification of a private household according to the household-dwelling concept as defined in the Annex to Regulation (EC) No 1201/2009 under the topic ‘Household status’;
‘generation of a family’ means the identification of a family based on information on whether the persons live in the same household, but with no or incomplete information on family relationships between persons. The term ‘family’ is specified as ‘family nucleus’ in the Annex to Regulation (EC) No 1201/2009 under the topic ‘Family status’;
‘unit no-information’ means the failure to collect any data from a statistical unit that is in the census population;
‘item no-information’ means the failure to collect data on one or more specified topics for a statistical unit that is in the census population, while data on at least one other topic can be collected for that statistical unit;
‘statistical disclosure control’ means the methods and processes applied in order to minimise the risk of disclosing information on individual statistical units while releasing as much statistical information as possible;
‘estimation’ means the calculation of statistics or estimates using a mathematical formula and/or algorithm applied to the available data;
‘coefficient of variation’ means the standard error (square root of the variance of an estimator) divided by the expected value of the estimator;
‘model assumption error’ means an error due to assumptions underlying the estimation and containing uncertainty or lack of detail;
‘data structure definition’ means a set of structural metadata associated with a data set, which includes information about how concepts are associated with the measures, dimensions, and attributes of a hypercube, along with information about the representation of data and related descriptive metadata.
Article 3
Metadata and quality reporting
Article 4
Data sources
Any data source shall be able to contribute information needed to fulfil the requirements of Regulation (EC) No 763/2008, in particular to
— meet the essential features as listed in Article 2(i) of Regulation (EC) No 763/2008 and defined in Article 2 (2) to (6),
— represent the target population,
— respect the relevant technical specifications laid down in Regulation (EC) No 1201/2009, and
— contribute to the provision of data for the programme of statistical data set out in Regulation (EU) No 519/2010.
Article 5
Access to relevant information
At the request of the Commission (Eurostat), Member States shall provide the Commission (Eurostat) with access to any information relevant to the assessment of the quality of the transmitted data and metadata as required by Regulation (EU) No 519/2010, excluding the transmission to and storage at the Commission of any microdata and confidential data.
Article 6
Technical format for data transmission
The technical format to be used for the transmission of data and metadata for the reference year 2011 shall be the Statistical Data and Metadata eXchange (SDMX) format. Member States shall transmit the required data conforming to the data structure definitions and related technical specifications provided by the Commission (Eurostat). Member States shall store the data and metadata for the 2011 reference year until 1 January 2035. Member States shall not be obliged to make changes or revisions to these data after 1 January 2025. Member States choosing to do so shall inform the Commission (Eurostat) about the changes or revisions before they are implemented.
Article 7
Entry into force
This Regulation shall enter into force on the twentieth day following its publication in the Official Journal of the European Union.
This Regulation shall be binding in its entirety and directly applicable in all Member States.
ANNEX I
Background information
The structure of the background information to the population and housing censuses conducted in the Member States for the reference year 2011 comprises the following sections:
1. OVERVIEW
2. DATA SOURCES (5)
3. CENSUS LIFECYCLE
3.2.1.1.Design and testing of questionnaires (including copies of all final questionnaires)
3.2.1.2.Preparation of any address lists, preparation of the field work, mapping, publicity
3.2.1.3.Data collection (including field work)
3.2.2.1.Creation of new registers from the year 2001 onwards (where applicable)
3.2.2.2.Re-design of existing registers from the year 2001 onwards (including changes in the contents of registers, adaptation of the census population, adaptation of definitions and/or technical specifications) (where applicable)
3.2.2.3.Maintenance of the registers (for each register used for the 2011 census), including
— content of the register (registered statistical units and information on the statistical units, any record editing and/or item imputation in the register)
— administrative responsibilities
— legal obligation to register information, incentives for providing truthful information or possible reasons for providing false information
— delays in reporting, in particular legal/official delays, data registration delays, late reporting
— evaluation of and clearance for non-registration, non-deregistration, multiple registration
— any major register revision that affects the 2011 census data, periodicity of register revisions
— stability (comparability of information on the registered population over time) (optional)
— usage, including ‘statistical usage of the register other than for the census’ and ‘usage of the register other than for statistical purposes (e.g. administrative purposes)’
3.2.2.4.Matching of registers (including identifying variable(s) used for record linkage)
3.2.2.5.Data extraction
ANNEX II
Quality-related data and metadata
The quality-related data and metadata about the data sources and topics comprise the items listed below.
1. RELEVANCE
Member States have to report on the adequacy of the data sources, in particular on the impact of any major deviation from the essential features of population and housing censuses and/or from the required definitions and concepts where this seriously impairs the adequate usage of the transmitted data.
The following data have to be provided for
— all geographical areas at the following levels: national level, NUTS 1, NUTS 2,
— all hypercubes (7) and all primary marginal distributions (7) : (1) number of all special cell values ‘not available’ (2) number of special cell values ‘not available’ flagged as ‘unreliable’ (3) number of special cell values ‘not available’ flagged as ‘confidential’ (4) number of numerical cell values flagged as ‘unreliable’
2. ACCURACY
The following information:
— has to be provided for each data source (section 2.1.) and each topic (section 2.2.), referring to person counts (8) and
— may be provided for data sources (section 2.1.) and topics (section 2.2.), referring to counts of statistical units other than persons (optional)
The data as required under point 2.1.1. have to be provided for all geographical areas at the following levels: national level, NUTS 1, NUTS 2. The explanatory metadata as required under point 2.1.2. have to be provided for the national level.
(1)Census population: absolute value and percentage of the estimated target population;
(2)Estimated target population (10): absolute value;
(3)Under-coverage (estimated): absolute value and percentage of the census population;
(4)Over-coverage (estimated): absolute value and percentage of the census population;
(5)Number of all record imputations (11): absolute value and percentage of the census population;
(6)Number of all record deletions (12): absolute value and percentage of the census population;
(7)Additionally, for samples: complementary set of statistical units (13): absolute value;
(8)Number of non-imputed records in the data source for statistical units belonging to the target population: absolute value (14), percentage of the census population (14) , percentage of the estimated target population (15), and percentage of all non-imputed records in the data source (before any record deletion) (16);
(9)additionally, for questionnaire-based data in the data source: (17) unit no-information (before record imputation): absolute value and percentage of the census population.
The explanatory metadata contain descriptions of
— the operation to assess under-coverage and over-coverage, including information on the quality of the estimates for under- and over-coverage,
— any method used to impute or delete records for statistical units,
— any method applied to weigh data records for statistical units,
— additionally for questionnaire-based data in the data source: (17) any measures to identify and limit unit no-information or other measures to correct errors during the collection of data.
The data required under point 2.2.1. have to be provided for all geographical areas at the following levels: national level, NUTS 1, NUTS 2. The explanatory metadata required under point 2.2.2. has to be provided for the national level.
(1)Census population (18): absolute value;
Reading this document does not replace reading the official text published in the Official Journal of the European Union. We assume no responsibility for any inaccuracies arising from the conversion of the original to this format.