The Second International Workshop on 
                Data Analysis Patterns in Software Engineering 
                          June 2, 2014 - Hyderabad, India 
                          http://dapse.unbox.org/2014

Data scientists in software engineering seek insight in data collected from
software projects to improve software development. The demand for these data
scientists is growing rapidly and there is already a shortage of them. Data
science is a skilled art with a steep learning curve. To reduce the learning
curve, this workshop will collect best practices in form of data analysis
patterns that lead to meaningful conclusions and can be reused in the context of
similar data. 

In the workshop, we will compile a catalog of such patterns that will help
novice and experienced data scientists to better communicate about data
analysis. The workshop is intended for anyone interested in how to analyze
software engineering data correctly and efficiently in a community accepted way. 

The major themes of this workshop are: big data, business intelligence,
replication of experiments, theory building, automated data analysis, and
their application in software engineering.

SUBMISSIONS

We are interested in patterns used to analyze data related to software
development and maintenance (e.g. project plans, code, bugs, reviews, social
networks) as well as generated with the use of software, (e.g. performance
data, runtime data, usage data, user profiles). 

We solicit papers of 3 pages maximum + one-page index card that summarizes the
proposed data analysis pattern. In their papers, we encourage authors to
describe patterns including the following information:
    * Pattern name: a handle for the pattern
    * Problem: why and when to apply the pattern
    * Solution: how to apply the pattern
    * Consequence: results and trade-offs of applying the pattern, common
      mistakes to be avoided while applying the pattern, etc.
    * Examples: brief summary and/or cite example applications of the pattern in
      literature; if possible, R snippets or Weka code to apply the pattern,
      etc.

To develop the one-page index card, authors should use the DAPSE template
(available in both Word and LaTex formats at
 http://dapse.unbox.org/2014/index-card-templates.zip).

Authors can submit two types of papers: archival papers and non-archival papers.
The program committee will review both types of papers. Accepted archival papers
will be published in the workshop proceedings and the ACM and IEEE Digital
Libraries. Accepted non-archival papers will be published in the DAPSE web site
only.

Submissions as archival papers undergo a two-stage process: review and feedback
stages. During the review stage, papers will be assessed based on the 1) clarity
of description, 2) singular purpose of the pattern, 3) relevance of the pattern
to address real SE problems, and 4) reusability of the pattern. During the
feedback stage, authors will be contacted and guided to a more mature
understanding of their patterns. Accepted papers will get a positive evaluation
at the end of the whole two-stage process. 

Prior application of the pattern by the authors is preferred but not mandatory.
This workshop is more interested in the mechanics and choice of the data
analysis than in the impact of published results.

All submissions must be in English.  Papers must follow the ICSE 2014 formatting
and submission instructions (http://2014.icse-conferences.org/format).  Each
paper must be accompanied by a pattern index card.  Each contribution must be
submitted electronically as one single PDF file including both paper and index
card, using the submission site hosted by EasyChair
(https://www.easychair.org/conferences/?conf=dapse2014).

It is the desire of the organizers that discussion of research at the workshop
does not preclude publication of closely related material at conferences or
journals. Authors of accepted papers will be able to choose whether to include
their papers in the workshop proceedings.

WORKSHOP FORMAT

Before the workshop, there will be a blog (http://dapse.unbox.org/) to promote 
and discuss accepted patterns. 

The workshop will consist of the following sessions:
    * Lightning session: Authors will present their proposed pattern in
      lightning talks (5-10 minutes). 
    * Discussion session: Groups of participants will discuss the purpose,
      relevance, and reusability of the.  This group work will eventually
      identify pattern types and classify/group patterns.
    * Breakout session: Groups of participants will use the data analysis
      patterns from the previous session to solve data science tasks provided by
      the workshop organizers. The tasks will come from academic research but
      also from industry.

In both discussion and breakout sessions, each group will present their findings
--- applicability and usefulness of patterns --- in a 5 minutes blitz presentation.
The discussion will be run according to the Delphi method
(http://en.wikipedia.org/wiki/Delphi_method) to help participants
to reach a common agreement on the proposed patterns and their structure.

After the workshop, the organizers will propose a follow-up workshop at ISERN'14
meeting. The organizers also plan to submit a paper titled "Analysis Patterns:
Elements of Reusable Data Analysis in SE" to ESEM'15. Selected authors from the
workshop will be invited to contribute to the article.

IMPORTANT DATES 

Archival Papers:
    Paper Submission:      January 31, 2014 (extended)
    PC Notification:       February 17, 2014
    PC Feedback:           February 18-23, 2014
    Author Notification:   February 24, 2014
    Camera-ready Version:  March 14, 2014
Non-Archival Papers:
    Paper Submission:      April 18, 2014
    Author Notification:   May 5, 2014
Workshop:                  June 2, 2014

PROGRAM COMMITTEE

Alberto Bacchelli, Delft University of Technology, The Netherlands
Anita Sarma, University of Nebraska, USA
Burak Turhan, University of Oulu, Finland
Carolyn Seaman, University of Maryland, USA
Chris Bird, Microsoft Research, USA
Daryl Posnett, University of California, USA
David Lo, Singapore Management University, Singapore
Earl Barr, University College London, UK
Forrest Shull, Fraunhofer Center, USA
Harald Gall, University of Zurich, Switzerland
Lionel Briand, University of Luxembourg, Luxembourg
Martin Shepperd, Brunel University, UK
Massimiliano Di Penta, University of Sannio, Italy
Nachiappan Nagappan, Microsoft Research, USA
Patrick Wagstrom, IBM TJ Watson Research Center, USA
Pete Rotella, Cisco Systems, USA
Rob DeLine, Microsoft Research, USA
Roberto Di Cosmo, INRIA, France
Tim Menzies, West Virginia University, USA
Tom Zimmermann, Microsoft Research, USA 
Tracy Hall, Brunel University, UK
Sandro Morasca, Università dell'Insubria, Italy
Shi Han, Microsoft Research, China
Stefan Wagner, University of Stuttgart, Germany
Vibha Sinha, IBM Research, India
Ye Yang, Chinese Academy of Sciences, China
Yuanfang Cai, Drexel University, USA

ORGANIZERS

General Chair: 
    Barbara Russo, Free University of Bozen-Bolzano, Italy
Program Co-Chairs: 
    Davide Falessi, Fraunhofer CESE, USA
    Venkatesh-Prasad Ranganath, Kansas State University, USA
Publicity and Social Media Co-Chairs: 
    Laura Inozemtseva, University of Waterloo, Canada
    Stefan Wagner, University of Stuttgart, Germany
Webmaster: 
    Rodrigo Rocha Gomes e Souza, Federal University of Bahia, Brazil