Record Details

LIBSSH Mirror - Testing Results (GitLab CI) - Version 2

Harvard Dataverse (Africa Rice Center, Bioversity International, CCAFS, CIAT, IFPRI, IRRI and WorldFish)

View Archive Info
 
 
Field Value
 
Title LIBSSH Mirror - Testing Results (GitLab CI) - Version 2
 
Identifier https://doi.org/10.7910/DVN/I1XAVR
 
Creator Prado Lima, Jackson
 
Publisher Harvard Dataverse
 
Description Context

SSH library (libssh) is an open-source C multiplatform library implementing the SSHv2 protocol on client and server-side. This library is designed to allow remotely execute programs, transfer files, use a secure and transparent tunnel, manage public keys, and the like. Libssh is a Highly-Configurable Software Systems (HCSS) that is statically configurable with the C preprocessor. It is available at here and hosted on GitLab.


This is a dataset from the LIBSSH system that contains records from GitLab CI build history. The data was mined in January, 2021, and the 10.052 logs identified are available in the zip file named "logs.zip".

Content

This dataset includes records from the period between 2018/04/12 and 2021/01/20. A total of 401 builds were included in the analysis. We discarded build logs with some problem, identified by GitLab CI, such as a problem to extract information (non-valid build log), and for that, the test cases did not execute. We identified a total of 281 failures, and 159 builds in which at least one test failed. Moreover, we found 64 unique test cases identified from build logs and a range of test cases executed in the builds between 1 and 60. Besides that, this dataset is also organized by system configuration (a.k.a product variant). In total, we identified 34 variants that failed at least once.


See LIBSSH_System.pdf file for a summary of the data, variants identified, test case volatility, failures by cycle, test duration, etc.

File Format

The data in the CSV files are separated by semicolons.


The data-filtering.csv files contain detailed information about each variant/system, and features-engineered.csv contains the data parsed to be used by a learning approach. The file data-variants.csv contains information across the variants. See the tool GitLabCI-Torrent for further information.
 
Subject Computer and Information Science
 
Contributor Prado Lima, Jackson