Finish introduction

author: Nguyễn Gia Phong <cnx@loang.net> 2023-12-11 08:30:17 +0900
committer: Nguyễn Gia Phong <cnx@loang.net> 2023-12-11 08:30:17 +0900
commit: 0f6b9207d5380eefb3a99d21a12751a32de63bfb (patch)
tree: 6528b28a3b8532b0afe67ebd787196076a6846bc /intro.tex
parent: 3285b6df66a9c3e0a6502ae75c93085f2d6a0f44 (diff)
download: thesis-0f6b9207d5380eefb3a99d21a12751a32de63bfb.tar.gz
1 files changed, 40 insertions, 4 deletions
diff --git a/intro.tex b/intro.tex
index 0a41d8f..4cb593f 100644
--- a/intro.tex
+++ b/intro.tex
@@ -1,8 +1,44 @@
 \setcounter{page}{1}
 \section{Introduction}
-TODO: mention symbolic execution\\
-TODO: mention \gls{apr}
+Modern software development is largely an iterative and incremental
+process~\cite{incremental}.  Due to practically commonly weak specifications
+and thus incomplete verification, each refinement may involve discovering,
+eliminating, and creating new software defects, or \glspl{bug}.
+Like for other software engineering tasks, efforts have been put into
+automating this debugging process, including detecting and locating
+software faults~\cite{fl-survey}, and generating fixes~\cite{apr}.
 
-This work's main contributions are:\begin{enumerate}
-  \item A tool named \textsc{Psychic}
+Progress on \gls{apr} is greatly hindered by the weak specification issue
+as well~\cite{apr}.  Due to the criteria for correctness being incomplete,
+patches generated may over-fit such test suites, resulting in an incomplete fix
+and/or new regression \glspl{bug}~\cite{overfit}.  For the same reason,
+more than one patch fitting the specification can be found.
+Contemporary \gls{apr} tools do not give insight on the difference
+between these patches, but only a ranking on potential correctness~\cite{apr}.
+
+While this ranking is beneficial in evaluating the tools themselves,
+together with the lack of certainty of correctness, it is an insufficient guide
+for choosing \emph{the} desired patch, if any.  \Gls{apr} users must then
+verify each patch to decide which to apply.  Recognizing the tedious nature
+of this process, we work on methods to highlight the semantic difference
+between patches, in form of \glspl{difftest}.  Existing automation techniques
+for differentiation such as black- or gray-box fuzzing~\cite{nezha}
+and symbolic execution~\cite{shadow} has shown promise on pairs
+of program revisions.  However, there is a lack in precedents in doing so
+at scale like for multiple \gls{apr}-generated patches.
+
+Symbolic execution is observed as one promising direction as it works directly
+with path constraints, which can be combined and manipulated to reveal
+semantic differences.  In this research, we explore symbolic execution
+for mass differential testing with the ultimate goal of making deciding on
+automatically generated patches easier for developers.  The main contributions
+of this work are:
+\begin{enumerate}
+  \item Introduction of semantic difference mining from multiple
+    program revisions as a semi-automated pipeline, for communicating
+    the reasoning behind each patch in form of a decision tree
+  \item A tool named \psychic{} implementing this process for C programs
+    and handling platform specificities for the generalization of application
+  \item Preliminary results of its performance
+    on patches automatically generated for small programs
 \end{enumerate}
author	Nguyễn Gia Phong <cnx@loang.net>	2023-12-11 08:30:17 +0900
committer	Nguyễn Gia Phong <cnx@loang.net>	2023-12-11 08:30:17 +0900
commit	0f6b9207d5380eefb3a99d21a12751a32de63bfb (patch)
tree	6528b28a3b8532b0afe67ebd787196076a6846bc /intro.tex
parent	3285b6df66a9c3e0a6502ae75c93085f2d6a0f44 (diff)
download	thesis-0f6b9207d5380eefb3a99d21a12751a32de63bfb.tar.gz