Overview
The proliferation of open-source projects has led to large amounts of source code and related artifacts: arguably, the rich and open resources associated with software--including open source repositories, Q/A sites, change histories, and communications between developers--are the richest and most detailed information resource for any technical area. Recently it has been discovered that “natural”, human-produced software has many interesting statistical regularities. As a consequence code corpora, just like natural language corpora, are amenable to statistical modeling, and a number of software tasks such as coding, testing, porting, bug-patching etc are potentially enhanced by the use of these statistical models.
This interdisciplinary workshop will explore issues related to the statistical modeling of software corpora, including topics such as: modeling repetitiveness in source code; use of language models for the code suggestion in IDEs; using probabilistic grammars to mine programming idioms; statistical methods for type inference in a dynamically typed languages; statistical machine translation for porting applications between programming languages, or “mini-fying”Javascript; using statistical language models to find bugs; or statistical methods for automatic code patching, code summarization, code retrieval, code annotation, or test generation.
The workshop follows several earlier workshops on this topic at Microsoft Research, Dagstuhl event, SIGSOFT FSE, and AAAI.
Call for participation
We invite you to join us in Lake Buena Vista, we have a great schedule of two keynote presentations, and a collection of presentations showcasing the latest work in this area.
Schedule
Program overview
Nov 4, 2018
- 8:30–8:45
- Welcome and Introductions
- 8:45-10:00
-
Rock Lake
- Keynote 1, Marc Brockschmidt, Microsoft Research (Title : "Learning from Code with Graphs" )
- 10:00-10:30
- Lakes Foyer Social Coffee Break
- 10:30-12:00
-
Rock Lake
- Total Recall, Language Processing, and Software Engineering (Long)
- Is "Naturalness" a Result of Deliberate Choice? (Long)
- A Fine-Grained Approach for Automated Conversion of JUnit Assertions to English (Long)
- TestNMT- Function-to-Test Neural Machine Translation (Short)
- 3CAP: Categorizing the Cognitive Capabilities of Alzheimer's Patients in a Smart Home Environment (Short)
- 12:00-13:30
- Lakeview Restaurant (West) Lunch
- 13.30-15:00
-
Rock Lake
- Towards Understanding Code Readability and its Impact on Design Quality (Long)
- Cleaning StackOverflow for use in Machine Translation (Long)
- Natural Language Processing (NLP) Applied on Issue Trackers as Eclipse Bugzilla (Short)
- Mining Monitoring Concerns Implementation in Java-based Software Systems (Short)
- Two Perspectives on Software Documentation Quality in StackOverflow (Short)
- 15:00-15:30
- Lakes Foyer Social Coffee Break
- 15:30-16:30
- Rock Lake
Keynote 2: Satish Chandra, Facebook (Title: "Bringing ML to the Developer")
- 16:45-17:00
- Rock Lake
NL4SE Workshop wrap up
Program Committee
Organizing Committee | |
---|---|
Yijun Yu | The Open University (UK) |
Erik Fredericks | Oakland University |
Prem Devanbu | University of California, Davis |
Program Committee | |
Miltos Allamanis | Microsoft Cambridge (UK) |
Marc Brockschmidt | Microsoft Cambridge (UK) |
Satish Chandra | Facebook, Inc. (USA) |
Premkumar Devanbu | University of California, Davis (USA) |
Erik Fredericks | Oakland University (USA) |
Reihaneh Hariri | Oakland University (USA) |
Abram Hindle | University of Alberta (Canada) |
Mark Marron | MSR, WA (USA) |
Fayola Peters | Lero (Ireland) |
Michael Pradel | TU Darmstadt (Germany) |
Guangzhi Qu | Oakland University (USA) |
Baishakhi Ray | Virginia (USA) |
Thein Than Tun | Open University (UK) |
Bogdan Vasilescu | Carnegie Mellon University (USA) |
Martin Vechev | ETH Zurich (Switzerland) |
Xiaoyin Wang | University of Texas St. Antonio (USA) |
Alistair Willis | Open University (UK) |
Yijun Yu | Open University (UK) |