Guidelines for Collecting and Disseminating Open Data
The National Institute of Development Administration (NIDA) was established to provide guidelines for the disclosure of research information by students at NIDA. These guidelines aim to ensure uniformity in the process, including the specification of characteristics for suitable data sets (Datasets) for storage and distribution. They also cover the format of data sets and distribution methods, adhering to the core principles of Open Data as defined by the Digital Government Development Agency (DGA), a public organization. These principles encompass: 1) Availability and Access, 2) Reuse and Redistribution, and 3) Universal Participation.
Characteristics of data sets suitable for publication are as follows:
- The data set format should be in a structured format such as a table or common file formats like CSV, XLS, XLSX, XML, or JSON. This ensures that the published data is fully usable.
- The information should primarily comprise statistical data, such as figures pertaining to solid waste quantities or monthly rainfall. It’s important to present sample data in a tabular format. Data in the form of transactions or lists is particularly suitable for open and free use.
- URLs linking to data should directly lead to the data itself. They should not direct users to an organization’s website or a Facebook page, as these formats are not suitable for data distribution within the system.
However, when publishing data sets, it’s important to consider their utility. The information contained within these data sets must also adhere to various laws, rules, and regulations. This includes ensuring that the data does not include any personal information that could violate the Personal Data Protection Act (PDPA) or any other relevant laws.
Type of data sets (Datasets) stored and distributed on the NIDA Wisdom repository should be as follows:
- CSV: comma-separated values
- XLS: (Microsoft document file Excel
- PDF: Portable Document Format (PDF)
- DOC: (Microsoft Word document file)
- XML: Extensible Markup Language (XML)
- RDF: Resource Description Framework (RDF)
- KML: Keyhole Markup Language
- SHP: Esri Shape file
- ODS: The Open Document Format for Office Applications (ODF)
- KMZ: (KML-Zipped)
- HTML: Hypertext Markup Language (HTML)
- TIFF: Tagged-Image File Format
- JPEG: Joint Photographic Experts Group
Information dissemination steps and processes
- Published datasets must comply with all applicable laws and regulations, including those related to privacy and security.
- Data should be formatted according to established standards, accompanied by a description that provides context for data users.
- To publish datasets, please specify the data owner’s name, faculty, and the thesis or research title, and send them to the NIDA Wisdom Repository at the National Institute of Development Administration via email at [email protected].