The eventdataR package contains several real-life and artificial event logs. Each can be loaded using the data function. The currently available event logs are listed below. More event logs will be added in the future.

Artifical data

Patients

Artificial eventlog about patients arriving in an emergency department of a hospital. This event log was used as the running example in the journal paper entitled Retrieving batch organisation of work insights from event logs.

library(eventdataR)
patients %>% summary
## Number of events:  5442
## Number of cases:  500
## Number of traces:  7
## Number of distinct activities:  7
## Average trace length:  10.884
## 
## Start eventlog:  2017-01-02 11:41:53
## End eventlog:  2018-05-05 07:16:02
##                   handling      patient          employee  handling_id       
##  Blood test           : 474   Length:5442        r1:1000   Length:5442       
##  Check-out            : 984   Class :character   r2:1000   Class :character  
##  Discuss Results      : 990   Mode  :character   r3: 474   Mode  :character  
##  MRI SCAN             : 472                      r4: 472                     
##  Registration         :1000                      r5: 522                     
##  Triage and Assessment:1000                      r6: 990                     
##  X-Ray                : 522                      r7: 984                     
##  registration_type      time                         .order    
##  complete:2721     Min.   :2017-01-02 11:41:53   Min.   :   1  
##  start   :2721     1st Qu.:2017-05-06 17:15:18   1st Qu.:1361  
##                    Median :2017-09-08 04:16:50   Median :2722  
##                    Mean   :2017-09-02 20:52:34   Mean   :2722  
##                    3rd Qu.:2017-12-22 15:44:11   3rd Qu.:4082  
##                    Max.   :2018-05-05 07:16:02   Max.   :5442  
## 

Real-life data

Sepsis

sepsis
## Log of 15214 events consisting of:
## 846 traces 
## 1050 cases 
## 15214 instances of 16 activities 
## 26 resources 
## Events occurred from 2013-11-07 08:18:29 until 2015-06-05 12:25:11 
##  
## Variables were mapped as follows:
## Case identifier:     case_id 
## Activity identifier:     activity 
## Resource identifier:     resource 
## Activity instance identifier:    activity_instance_id 
## Timestamp:           timestamp 
## Lifecycle transition:        lifecycle 
## 
## # A tibble: 15,214 x 34
##    case_id activity lifecycle resource timestamp             age   crp diagnose
##    <chr>   <fct>    <fct>     <fct>    <dttm>              <int> <dbl> <chr>   
##  1 A       ER Regi~ complete  A        2014-10-22 11:15:41    85    NA A       
##  2 A       Leucocy~ complete  B        2014-10-22 11:27:00    NA    NA <NA>    
##  3 A       CRP      complete  B        2014-10-22 11:27:00    NA   210 <NA>    
##  4 A       LacticA~ complete  B        2014-10-22 11:27:00    NA    NA <NA>    
##  5 A       ER Tria~ complete  C        2014-10-22 11:33:37    NA    NA <NA>    
##  6 A       ER Seps~ complete  A        2014-10-22 11:34:00    NA    NA <NA>    
##  7 A       IV Liqu~ complete  A        2014-10-22 14:03:47    NA    NA <NA>    
##  8 A       IV Anti~ complete  A        2014-10-22 14:03:47    NA    NA <NA>    
##  9 A       Admissi~ complete  D        2014-10-22 14:13:19    NA    NA <NA>    
## 10 A       CRP      complete  B        2014-10-24 09:00:00    NA  1090 <NA>    
## # ... with 15,204 more rows, and 26 more variables: diagnosticartastrup <chr>,
## #   diagnosticblood <chr>, diagnosticecg <chr>, diagnosticic <chr>,
## #   diagnosticlacticacid <chr>, diagnosticliquor <chr>, diagnosticother <chr>,
## #   diagnosticsputum <chr>, diagnosticurinaryculture <chr>,
## #   diagnosticurinarysediment <chr>, diagnosticxthorax <chr>, disfuncorg <chr>,
## #   hypotensie <chr>, hypoxie <chr>, infectionsuspected <chr>, infusion <chr>,
## #   lacticacid <dbl>, leucocytes <chr>, oligurie <chr>,
## #   sirscritheartrate <chr>, sirscritleucos <chr>, sirscrittachypnea <chr>,
## #   sirscrittemperature <chr>, sirscriteria2ormore <chr>,
## #   activity_instance_id <chr>, .order <int>

Hospital Log

hospital
## # A tibble: 53 x 7
##    patient_visit_nr activity originator start_ts complete_ts triagecode
##               <dbl> <chr>    <chr>      <chr>    <chr>            <dbl>
##  1              510 registr~ Clerk 9    20/11/2~ 20/11/2017~          3
##  2              512 Registr~ Clerk 12   20/11/2~ 20/11/2017~          3
##  3              510 Triage   Nurse 27   20/11/2~ 20/11/2017~          3
##  4              512 Triage   Nurse 27   20/11/2~ 20/11/2017~          3
##  5              512 Clinica~ Doctor 7   20/11/2~ 20/11/2017~          3
##  6              510 Clinica~ Doctor 7   20/11/2~ 20/11/2017~         NA
##  7              517 Triage   Nurse 17   21/11/2~ 21/11/2017~          3
##  8              518 Registr~ Clerk 12   21/11/2~ 21/11/2017~          4
##  9              518 Registr~ Clerk 6    21/11/2~ 21/11/2017~          4
## 10              518 Registr~ Clerk 9    21/11/2~ 21/11/2017~          4
## # ... with 43 more rows, and 1 more variable: specialization <chr>

Hospital Billing

hospital_billing
## Log of 49951 events consisting of:
## 288 traces 
## 10000 cases 
## 49951 instances of 16 activities 
## 566 resources 
## Events occurred from 2012-12-13 10:13:18 until 2015-12-13 13:55:15 
##  
## Variables were mapped as follows:
## Case identifier:     case_id 
## Activity identifier:     activity 
## Resource identifier:     resource 
## Activity instance identifier:    activity_instance_id 
## Timestamp:           timestamp 
## Lifecycle transition:        lifecycle 
## 
## # A tibble: 49,951 x 25
##    case_id activity lifecycle resource timestamp           actorange actred
##    <chr>   <fct>    <fct>     <fct>    <dttm>              <chr>     <chr> 
##  1 A       NEW      complete  ResA     2012-12-16 19:33:10 <NA>      <NA>  
##  2 A       FIN      complete  <NA>     2013-12-15 19:00:37 <NA>      <NA>  
##  3 A       RELEASE  complete  <NA>     2013-12-16 03:53:38 <NA>      <NA>  
##  4 A       CODE OK  complete  <NA>     2013-12-17 12:56:29 false     false 
##  5 A       BILLED   complete  ResB     2013-12-19 03:44:31 <NA>      <NA>  
##  6 B       NEW      complete  ResA     2012-12-16 19:33:50 <NA>      <NA>  
##  7 B       DELETE   complete  ResC     2013-10-19 12:37:05 <NA>      <NA>  
##  8 C       NEW      complete  ResA     2013-01-13 21:04:24 <NA>      <NA>  
##  9 C       FIN      complete  <NA>     2013-04-17 19:59:43 <NA>      <NA>  
## 10 C       RELEASE  complete  <NA>     2013-04-18 02:30:35 <NA>      <NA>  
## # ... with 49,941 more rows, and 18 more variables: blocked <chr>,
## #   casetype <chr>, closecode <chr>, diagnosis <chr>, flaga <chr>, flagb <chr>,
## #   flagc <chr>, flagd <chr>, iscancelled <chr>, isclosed <chr>, msgcode <chr>,
## #   msgcount <int>, msgtype <chr>, speciality <chr>, state <chr>,
## #   version <chr>, activity_instance_id <chr>, .order <int>

Road Traffic Fine Management

traffic_fines
## Log of 34724 events consisting of:
## 44 traces 
## 10000 cases 
## 34724 instances of 11 activities 
## 16 resources 
## Events occurred from 2006-06-17 until 2012-03-26 
##  
## Variables were mapped as follows:
## Case identifier:     case_id 
## Activity identifier:     activity 
## Resource identifier:     resource 
## Activity instance identifier:    activity_instance_id 
## Timestamp:           timestamp 
## Lifecycle transition:        lifecycle 
## 
## # A tibble: 34,724 x 18
##    case_id activity lifecycle resource timestamp           amount article
##    <chr>   <fct>    <fct>     <fct>    <dttm>               <dbl>   <int>
##  1 A1      Create ~ complete  561      2006-07-24 00:00:00    350     157
##  2 A1      Send Fi~ complete  <NA>     2006-12-05 00:00:00     NA      NA
##  3 A100    Create ~ complete  561      2006-08-02 00:00:00    350     157
##  4 A100    Send Fi~ complete  <NA>     2006-12-12 00:00:00     NA      NA
##  5 A100    Insert ~ complete  <NA>     2007-01-15 00:00:00     NA      NA
##  6 A100    Add pen~ complete  <NA>     2007-03-16 00:00:00    715      NA
##  7 A100    Send fo~ complete  <NA>     2009-03-30 00:00:00     NA      NA
##  8 A10000  Create ~ complete  561      2007-03-09 00:00:00    360     157
##  9 A10000  Send Fi~ complete  <NA>     2007-07-17 00:00:00     NA      NA
## 10 A10000  Insert ~ complete  <NA>     2007-08-02 00:00:00     NA      NA
## # ... with 34,714 more rows, and 11 more variables: dismissal <chr>,
## #   expense <dbl>, lastsent <chr>, matricola <chr>, notificationtype <chr>,
## #   paymentamount <dbl>, points <int>, totalpaymentamount <chr>,
## #   vehicleclass <chr>, activity_instance_id <chr>, .order <int>