Access to individual data in France Michel ISNARD Insee – Head of Legal Affairs 28/10/2013
Download ReportTranscript Access to individual data in France Michel ISNARD Insee – Head of Legal Affairs 28/10/2013
Access to individual data in France Michel ISNARD Insee – Head of Legal Affairs 28/10/2013 Individual data files in France • Public Use Files • Scientific Use Files • Secure Use Files • Specific topics 2 Joint UNECE/Eurostat Work Session on Statistical Data Confidentiality – Ottawa 2013 28/10/2013 Public Use Files • On Insee’s website • “Households” data Labour force survey Census data : 2 files One with a localisation at regional level (27 regions in France) and detailed social variables One with a localisation at municipality level and variables with aggregated modalities Some register files • http://www.insee.fr/fr/bases-de-donnees/fichiers-detail.asp In French 3 Joint UNECE/Eurostat Work Session on Statistical Data Confidentiality - Ottawa 2013 28/10/2013 Scientific Use files • For researchers with specific documentation for researchers • But : Who is a researcher ? And who is not ? What kind of documentation did they need ? • Statisticians need some help 4 Joint UNECE/Eurostat Work Session on Statistical Data Confidentiality - Ottawa 2013 28/10/2013 Scientific Use Files (2) • Réseau Quetelet : French Data Archives Formally created in 2001 But result of a longer cooperation between Insee and some researchers • Disseminates Insee (and other) SUF to French and foreign researchers. • Therefor determines who is a researcher or not. • Help Insee to create a documentation usable by researchers • http://www.reseau-quetelet.cnrs.fr/spip/ 5 Joint UNECE/Eurostat Work Session on Statistical Data Confidentiality - Ottawa 2013 28/10/2013 Confidential Files • Long history in France for business data Since 1984 • More recent for Household data Since 2008 • Procedure : Opinion by an external committee : Statistical Confidential Committee Chaired by a judge Participation of representatives of business unions, worker unions and researchers Agreement of Insee Decision by National Archives 6 Joint UNECE/Eurostat Work Session on Statistical Data Confidentiality - Ottawa 2013 28/10/2013 Confidential files (2) • Longer procedure than in other countries But probably more acceptable 200 access requests a year • Access Through Genes’s CASD http://www.casd.eu 7 Joint UNECE/Eurostat Work Session on Statistical Data Confidentiality - Ottawa 2013 28/10/2013 How to get data ? • First stop : Réseau Quetelet If SUF enough, get the data • Second stop : Confidentiality Committee secretary and the data producer To see if confidential data will solve the problem • Third stop Confidentiality Committee • Fourth stop CASD 8 Joint UNECE/Eurostat Work Session on Statistical Data Confidentiality - Ottawa 2013 28/10/2013 Specific topics • Output checking – My OWN PERSONAL OPINION Is it useful ? Enough ? Efficient ? Will only cope with remote access 9 Joint UNECE/Eurostat Work Session on Statistical Data Confidentiality - Ottawa 2013 28/10/2013 Output checking in remote access • Some preliminary remarks : An output file can’t be more informative than the confidential file the researcher is allowed to browse A researcher has already signed a confidentiality clause and could be, depending on national laws, bound by penal responsibility A researcher could easily remember the value of some specific variable and therefore extract it from the safe centre. Who is in charge if there’s a confidentiality break ? The NSI ? The researcher? 10 Joint UNECE/Eurostat Work Session on Statistical Data Confidentiality - Ottawa 2013 28/10/2013 Output checking in remote access • OC can’t be effective : If a researcher wants to smuggle ONE specific information outside the secure centre, NSI can’t check. He/She just has to remember it!!! He/She could also makes specific operations to know some confidential data about a group of units. Checking thoroughly all the output of a researcher and are sure there’s no confidentiality breach is not enough You also have to check them with every published output made on the same data 11 Joint UNECE/Eurostat Work Session on Statistical Data Confidentiality - Ottawa 2013 28/10/2013 Output checking in remote access • OC could be very expensive : Of course, we could have the researchers paying OC, but is it a long term solution ? Specially if 99% of researchers follow strictly confidentiality rules • OC is very dangerous for NSIs : If an individual person or a business happens to know about some confidentiality breach, the NSI in charge of then OC could be accused and confidence could be lost • But we need to have a protection against a complete download of the data : Look at the size of the output Check its form 12 Joint UNECE/Eurostat Work Session on Statistical Data Confidentiality - Ottawa 2013 28/10/2013