Skip to Content

modeling.mdt.ocl

About this document

This document is a R notebook, dynamically created from the numbers extracted on the project. It lists all datasets published for the project, providing basic numbers, figures and a quick summary, and serves as a test case to make sure that all the required data is present and roughly consistent with requirements. All plots and tables are computed from the actual data as provided in the downloads.

To re-execute the document, simply start a R session, load rmarkdown and render the page with the project ID as a parameter:

require('rmarkdown')
render("datasets_report.Rmarkdown", params = list(project_id = "modeling.mdt.ocl"), output_format="html_document")

This website uses the blogdown R package, which provides a different output_format for the hugo framework.

This report was generated on 2021-04-25.

Downloads

All data is retrieved from Alambic, an open-source framework for development data extraction and processing.

This project’s analysis page can be found on the Alambic instance for the Eclipse forge, at https://eclipse.alambic.io/projects/modeling.mdt.ocl.

Downloads are composed of gzip’d CSV and JSON files. CSV files always have a header to name the fields, which makes it easy to import in analysis software like R:

data <- read.csv(file='myfile.csv', header=T)
names(data)

List of datasets generated for the project:

  • Git
    • Git Commits (CSV) – Full list of commits with id, message, time, author, committer, and added, deleted and modifed lines.
    • Git Commits Evol (CSV) – Evolution of number of commits and authors by day.
    • Git Log (TXT) – the raw export of git log.
  • Bugzilla
  • Eclipse Forums
    • Forums Posts (CSV) – list of all forum posts for this project.
    • Forums threads (CSV) – list of all forum threads for this project.
  • Jenkins CI
  • Eclipse PMI
    • PMI Checks (CSV) – list of all checks applied to the Project Management Infrastructure entries for the project.

Git

Git commits

Download: git_commits_evol.csv.gz

data <- read.csv(file=file_git_commits_evol, header=T)

File is git_commits_evol.csv, and has 3 columns for 2241 entries.

data$commits_sum <- cumsum(data$commits)
data.xts <- xts(x = data[,c('commits_sum', 'commits', 'authors')], order.by=as.POSIXct(as.character(data[,c('date')]), format="%Y-%m-%d"))

time.min <- index(data.xts[1,])
time.max <- index(data.xts[nrow(data.xts)])
all.dates <- seq(time.min, time.max, by="days")
empty <- xts(order.by = all.dates)

merged.data <- merge(empty, data.xts, all=T)
merged.data[is.na(merged.data) == T] <- 0

p <-dygraph(merged.data[,c('commits')],
        main = paste('Daily commits for ', project_id, sep=''),
        width = 800, height = 250 ) %>%
      dyRangeSelector()
p


Git log

Download: git_log.txt.gz

File is git_log.txt, and full log has 53853 lines.


Bugzilla

Bugzilla issues

Download: bugzilla_issues.csv.gz

data <- read.csv(file=file_bz_issues, header=T)

File is bugzilla_issues.csv, and has 17 columns for 2173 issues.

Bugzilla open issues

Download: bugzilla_issues_open.csv.gz

data <- read.csv(file=file_bz_issues_open, header=T)

File is bugzilla_issues_open.csv, and has 17 columns for 627 issues (all open).

Bugzilla evolution

Download: bugzilla_evol.csv.gz

data <- read.csv(file=file_bz_evol, header=T)

File is bugzilla_evol.csv, and has 3 columns for 1497 weeks.

Let’s try to plot the monthly number of submissions for the project:

Versions

Download: bugzilla_versions.csv.gz

data <- read.csv(file=file_bz_versions, header=T)

File is bugzilla_versions.csv, and has 2 columns for 20 weeks.

Components

Download: bugzilla_components.csv.gz

data <- read.csv(file=file_bz_components, header=T)

File is bugzilla_components.csv, and has 2 columns for 1 weeks.

data.sorted <- data[order(data$Bugs, decreasing = T),]

g <- gvisColumnChart(data.sorted, options=list(title='List of product components', legend="{position: 'none'}", width="automatic", height="300px"))
plot(g)

Eclipse Forums

Forums posts

Download: eclipse_forums_posts.csv.gz

data <- read.csv(file=file_forums_posts, header=T)

File is eclipse_forums_posts.csv, and has 6 columns for 6874 posts. The evolution of posts

data$created.date <- as.POSIXct(data$created_date, origin="1970-01-01")
posts.xts <- xts(data, order.by = data$created.date)

time.min <- index(posts.xts[1,])
time.max <- index(posts.xts[nrow(posts.xts)])
all.dates <- seq(time.min, time.max, by="weeks")
empty <- xts(order.by = all.dates)

merged.data <- merge(empty, posts.xts$id, all=T)
merged.data[is.na(merged.data) == T] <- 0

posts.weekly <- apply.weekly(x=merged.data, FUN = nrow)
names(posts.weekly) <- c("posts")

p <- dygraph(
  data = posts.weekly[-1,],
  main = paste('Weekly forum posts for ', project_id, sep=''),
  width = 800, height = 250 ) %>%
  dyAxis("x", drawGrid = FALSE) %>%
  dySeries("posts", label = "Weekly posts") %>%
  dyOptions(stepPlot = TRUE) %>%
  dyRangeSelector()
p

The list of the 10 last active posts on the forums:

data$created.date <- as.POSIXct(data$created_date, origin="1970-01-01")
posts.table <- head(data[,c('id', 'subject', 'created.date', 'author_id')], 10)
posts.table$subject <- paste('<a href="', posts.table$html_url, '">', posts.table$subject, '</a>', sep='')
posts.table$created.date <- as.character(posts.table$created.date)
names(posts.table) <- c('ID', 'Subject', 'Post date', 'Post author')

print(
    xtable(head(posts.table, 10),
        caption = paste('10 most recent posts on', project_id, 'forum.', sep=" "),
        digits=0, align="lllll"), type="html",
    html.table.attributes='class="table table-striped"',
    caption.placement='bottom',
    include.rownames=FALSE,
    sanitize.text.function=function(x) { x }
)
10 most recent posts on modeling.mdt.ocl forum.
ID Subject Post date Post author
1837605 Re: Issue with mdt-ocl-Update sites 2021-02-03 06:02:25 8586
1837597 Re: Generate Java code for generic classes 2021-02-03 04:25:42 177965
1837595 Re: Generate Java code for generic classes 2021-02-02 23:14:36 8586
1837589 Re: Issue with mdt-ocl-Update sites 2021-02-02 20:38:34 231317
1837583 Re: Issue with mdt-ocl-Update sites 2021-02-02 19:21:31 8586
1837555 Issue with mdt-ocl-Update sites 2021-02-02 10:20:17 231317
1837540 Re: Generate Java code for generic classes 2021-02-02 04:08:48 177965
1837525 Re: Generate Java code for generic classes 2021-02-01 19:14:48 8586
1837503 Re: Generate Java code for generic classes 2021-02-01 09:38:10 8586
1837493 Generate Java code for generic classes 2021-02-01 05:57:19 177965


Forums threads

Download: eclipse_forums_threads.csv.gz

data <- read.csv(file=file_forums_threads, header=T)

File is eclipse_forums_threads.csv, and has 8 columns for 1513 threads. A wordcloud with the main words used in threads is presented below.

The list of the 10 last active threads on the forums:

data$last.post.date <- as.POSIXct(data$last_post_date, origin="1970-01-01")
threads.table <- head(data[,c('id', 'subject', 'last.post.date', 'last_post_id', 'replies', 'views')], 10)
threads.table$subject <- paste('<a href="', threads.table$html_url, '">', threads.table$subject, '</a>', sep='')
threads.table$last.post.date <- as.character(threads.table$last.post.date)
names(threads.table) <- c('ID', 'Subject', 'Last post date', 'Last post author', 'Replies', 'Views')

print(
    xtable(threads.table,
        caption = paste('10 last active threads on', project_id, 'forum.', sep=" "),
        digits=0, align="lllllll"), type="html",
    html.table.attributes='class="table table-striped"',
    caption.placement='bottom',
    include.rownames=FALSE,
    sanitize.text.function=function(x) { x }
)
10 last active threads on modeling.mdt.ocl forum.
ID Subject Last post date Last post author Replies Views
1106816 Issue with mdt-ocl-Update sites 2021-02-03 06:02:25 1837605 3 373
1106797 Generate Java code for generic classes 2021-02-03 04:25:42 1837597 5 470
1106768 Missing Javadoc 2021-01-29 11:21:09 1837401 5 316
1106594 Get the query result from OCL Java API in the format of XText OCL result 2021-01-13 19:02:03 1836780 1 119
1106081 run OCL on models programmatically 2020-11-27 08:14:09 1835209 3 255
1105573 Class CastException in OCLinEcore generated code 2020-10-16 10:27:22 1833544 3 244
1105003 Looking for OCL Rules for BPMN 2020-08-26 15:56:54 1831690 1 535
1104313 Noob To OCL needs Help with XMI 2020-06-30 20:25:59 1829295 3 659
1104062 eAllContents() or closure of eContents() in OCL 2020-06-08 15:42:50 1828372 1 379
1103746 Problems with OCL validation and derived properties 2020-05-16 18:42:08 1827518 13 1000

Jenkins

Builds

Download: jenkins_builds.csv.gz

data <- read.csv(file=file_jenkins_builds, header=T)

File is jenkins_builds.csv, and has 7 columns for 126 commits.

ID Name Time Result
272 ocl-branch-tests \#272 1.618554e+12 SUCCESS
52 ocl-codegen-tests \#52 1.618745e+12 SUCCESS
50 ocl-compatibility-2018-09 \#50 1.618763e+12 SUCCESS
53 ocl-compatibility-2018-12 \#53 1.618768e+12 SUCCESS
51 ocl-compatibility-2019-03 \#51 1.618770e+12 SUCCESS
50 ocl-compatibility-2019-06 \#50 1.618776e+12 SUCCESS
49 ocl-compatibility-2019-09 \#49 1.618778e+12 SUCCESS
56 ocl-compatibility-2019-12 \#56 1.618783e+12 SUCCESS
52 ocl-compatibility-2020-03 \#52 1.618787e+12 SUCCESS
47 ocl-compatibility-2020-03 \#47 1.616368e+12 SUCCESS


Jobs

Download: jenkins_jobs.csv.gz

data <- read.csv(file=file_jenkins_jobs, header=T)

File is jenkins_jobs.csv, and has 15 columns for 23 commits.

Name Colour Last build time Health report
ocl-branch-tests blue 1.618554e+12 80
ocl-codegen-tests blue 1.618745e+12 100
ocl-compatibility-2018-09 blue 1.618763e+12 100
ocl-compatibility-2018-12 blue 1.618768e+12 100
ocl-compatibility-2019-03 blue 1.618770e+12 100
ocl-compatibility-2019-06 blue 1.618776e+12 100
ocl-compatibility-2019-09 blue 1.618778e+12 100
ocl-compatibility-2019-12 blue 1.618783e+12 100
ocl-compatibility-2020-03 blue 1.618787e+12 100
ocl-compatibility-2020-06 blue 1.618788e+12 100


PMI

PMI Checks

Download: eclipse_pmi_checks.csv.gz

data <- read.csv(file=file_pmi_checks, header=T)

File is eclipse_pmi_checks.csv, and has 3 columns for 17 commits.

checks.table <- head(data[,c('Description', 'Value', 'Results')], 10)

print(
    xtable(checks.table,
        caption = paste('Extract of the 10 first PMI checks for ', 
                        project_id, '.', sep=" "),
        digits=0, align="llll"), type="html",
    html.table.attributes='class="table table-striped"',
    caption.placement='bottom',
    include.rownames=FALSE,
    sanitize.text.function=function(x) { x }
)
Extract of the 10 first PMI checks for modeling.mdt.ocl .
Description Value Results
Checks if the URL can be fetched using a simple get query. https://bugs.eclipse.org/bugs/enter\_bug.cgi?product=OCL OK: Create URL could be successfully fetched.
Checks if the URL can be fetched using a simple get query. https://bugs.eclipse.org/bugs/buglist.cgi?product=OCL&component=Core OK: Query URL could be successfully fetched.
Sends a get request to the given CI URL and looks at the headers in the response (200 404..). Also checks if the URL is really a Hudson instance (through a call to its API). https://ci.eclipse.org/ocl/job/ocl-master/ OK. Fetched CI URL.\\Failed: CI URL is not the root of a Hudson instance.
Checks if the Dev ML URL can be fetched using a simple get query. https://dev.eclipse.org/mailman/listinfo/mdt-ocl.dev OK: Dev ML URL could be successfully fetched.
Checks if the URL can be fetched using a simple get query. http://download.eclipse.org/ocl/doc/6.4.0/ocl.pdf OK: Documentation URL could be successfully fetched.
Checks if the URL can be fetched using a simple get query. http://www.eclipse.org/modeling/mdt/downloads/?project=ocl OK: Download URL could be successfully fetched.
Checks if the Forums URL can be fetched using a simple get query. http://eclipse.org/forums/eclipse.modeling.mdt.ocl OK. Forum \[Eclipse OCL Forum\] correctly defined.\\OK: Forum \[Eclipse OCL Forum\] URL could be successfully fetched.
Checks if the URL can be fetched using a simple get query. http://help.eclipse.org/oxygen/topic/org.eclipse.ocl.doc/help/GettingStarted.html OK: Documentation URL could be successfully fetched.
Checks if the Mailing lists URL can be fetched using a simple get query. Failed: no mailing list defined.
Checks if the URL can be fetched using a simple get query. Failed: no URL defined for plan.