Merge pull request #88 from rajdeepsh/main

mrigger · web-flow · commit 2b9177fd38c1 · 2025-02-18T13:33:28.000+08:00
Added ICSE'25 Paper &amp; Updated Profile
diff --git a/content/authors/rajdeep-sh/_index.md b/content/authors/rajdeep-sh/_index.md
@@ -53,7 +53,7 @@ social:
 #   link: https://scholar.google.co.uk/citations?user=sIwtMXoAAAAJ
 - icon: github
   icon_pack: fab
-  link: https://github.com/rajfly
+  link: https://github.com/rajdeepsh
 # Link to a PDF of your resume/CV from the About widget.
 # To enable, copy your resume/CV to `static/files/cv.pdf` and uncomment the lines below.
 # - icon: cv
diff --git a/content/authors/rajdeep-sh/avatar.jpeg b/content/authors/rajdeep-sh/avatar.jpeg
diff --git a/content/authors/rajdeep-sh/avatar.jpg b/content/authors/rajdeep-sh/avatar.jpg
diff --git a/content/post/10-02-25-ICSE25-Mistaken-Assumption/index.md b/content/post/10-02-25-ICSE25-Mistaken-Assumption/index.md
@@ -0,0 +1,6 @@
+---
+title: Our paper "On the Mistaken Assumption of Interchangeable Deep Reinforcement Learning Implementations" was accepted at ICSE '25!
+date: 2025-02-10
+---
+
+
diff --git a/content/publication/2025-icse-mistaken-assumption/cite.bib b/content/publication/2025-icse-mistaken-assumption/cite.bib
@@ -0,0 +1,13 @@
+@inproceedings{mistakenassumption,
+author = {Hundal, Rajdeep Singh and Xiao, Yan and Cao, Xiaochun and Dong, Jin Song and Rigger, Manuel},
+title = {On the Mistaken Assumption of Interchangeable Deep Reinforcement Learning Implementations},
+year = {2025},
+publisher = {Association for Computing Machinery},
+address = {New York, NY, USA},
+abstract = {Deep Reinforcement Learning (DRL) is a paradigm of artificial intelligence where an agent uses a neural network to learn which actions to take in a given environment. DRL has recently gained traction from being able to solve complex environments like driving simulators, 3D robotic control, and multiplayer-online-battle-arena video games. Numerous implementations of the state-of-the-art algorithms responsible for training these agents, like the Deep Q-Network (DQN) and Proximal Policy Optimization (PPO) algorithms, currently exist. However, studies make the mistake of assuming implementations of the same algorithm to be consistent and thus, interchangeable. In this paper, through a differential testing lens, we present the results of studying the extent of implementation inconsistencies, their effect on the implementations' performance, as well as their impact on the conclusions of prior studies under the assumption of interchangeable implementations. The outcomes of our differential tests showed significant discrepancies between the tested algorithm implementations, indicating that they are not interchangeable. In particular, out of the five PPO implementations tested on 56 games, three implementations achieved superhuman performance for 50% of their total trials while the other two implementations only achieved superhuman performance for less than 15% of their total trials. Furthermore, the performance among the high-performing PPO implementations was found to differ significantly in nine games. As part of a meticulous manual analysis of the implementations' source code, we analyzed implementation discrepancies and determined that code-level inconsistencies primarily caused these discrepancies. Lastly, we replicated a study and showed that this assumption of implementation interchangeability was sufficient to flip experiment outcomes. Therefore, this calls for a shift in how implementations are being used. In addition, we recommend for (1) replicability studies for studies mistakenly assuming implementation interchangeability, (2) DRL researchers and practitioners to adopt the differential testing methodology proposed in this paper to combat implementation inconsistencies, and (3) the use of large environment suites.},
+booktitle = {Proceedings of the IEEE/ACM 47th International Conference on Software Engineering},
+numpages = {13},
+keywords = {reinforcement learning, differential testing},
+location = {Ottawa, Canada},
+series = {ICSE '25}
+}
diff --git a/content/publication/2025-icse-mistaken-assumption/index.md b/content/publication/2025-icse-mistaken-assumption/index.md
@@ -0,0 +1,75 @@
+---
+title: "On the Mistaken Assumption of Interchangeable Deep Reinforcement Learning Implementations"
+authors:
+- Rajdeep Sh
+- Yan Xiao
+- Xiaochun Cao
+- Jin Song Dong
+- Manuel Rigger
+date: "2025-05-01T00:00:00Z"
+doi: ""
+
+# Schedule page publish date (NOT publication's date).
+publishDate: "2025-02-08T00:00:00Z"
+
+# Publication type.
+# Legend: 0 = Uncategorized; 1 = Conference paper; 2 = Journal article;
+# 3 = Preprint / Working Paper; 4 = Report; 5 = Book; 6 = Book section;
+# 7 = Thesis; 8 = Patent
+publication_types: ["1"]
+
+# Publication name and optional abbreviated publication name.
+publication: In *Proceedings of the 47th International Conference on Software Engineering*
+publication_short: In *ICSE 2025*
+
+# abstract: Database systems are widely used to store and query data. Test oracles have been proposed to find logic bugs in such systems, that is, bugs that cause the database system to compute an incorrect result. To realize a fully automated testing approach, such test oracles are paired with a test case generation technique; a test case refers to a database state and a query on which the test oracle can be applied. In this work, we propose the concept of Query Plan Guidance (QPG) for guiding automated testing towards "interesting" test cases. SQL and other query languages are declarative. Thus, to execute a query, the database system translates every operator in the source language to one of potentially many so-called physical operators that can be executed; the tree of physical operators is referred to as the query plan. Our intuition is that by steering testing towards exploring diverse query plans, we also explore more interesting behaviors—some of which are potentially incorrect. To this end, we propose a mutation technique that gradually applies promising mutations to the database state, causing the DBMS to create diverse query plans for subsequent queries. We applied our method to three mature, widely-used, and extensively-tested database systems—SQLite, TiDB, and CockroachDB—and found 53 unique, previously unknown bugs. Our method exercises 4.85—408.48× more unique query plans than a naive random generation method and 7.46× more than a code coverage guidance method. Since most database systems—including commercial ones—expose query plans to the user, we consider QPG a generally applicable, black-box approach and believe that the core idea could also be applied in other contexts (e.g., to measure the quality of a test suite).
+# Summary. An optional shortened abstract.
+# summary: Lorem ipsum dolor sit amet, consectetur adipiscing elit. Duis posuere tellus ac convallis placerat. Proin tincidunt magna sed ex sollicitudin condimentum.
+
+#tags:
+#- Source Themes
+#featured: true
+
+#links:
+#- name: Custom Link
+#  url: http://example.org
+# url_pdf: https://dl.acm.org/doi/pdf/10.1145/3597503.3623307
+#url_code: '#'
+#url_dataset: '#'
+#url_poster: '#'
+#url_project: ''
+#url_slides: ''
+#url_source: '#'
+#url_video: '#'
+
+# Featured image
+# To use, add an image named `featured.jpg/png` to your page's folder. 
+#image:
+#  caption: 'Image credit: [**Unsplash**](https://unsplash.com/photos/pLCdAaMFLTE)'
+#  focal_point: ""
+# preview_only: false
+
+# Associated Projects (optional).
+#   Associate this publication with one or more of your projects.
+#   Simply enter your project's folder or file name without extension.
+#   E.g. `internal-project` references `content/project/internal-project/index.md`.
+#   Otherwise, set `projects: []`.
+#projects:
+#- sqlancer
+
+# Slides (optional).
+#   Associate this publication with Markdown slides.
+#   Simply enter your slide deck's filename without extension.
+#   E.g. `slides: "example"` references `content/slides/example/index.md`.
+#   Otherwise, set `slides: ""`.
+#slides:
+
+# move below
+#{{% callout note %}}
+#Click the *Cite* button above to demo the feature to enable visitors to import publication metadata into their reference management software.
+#{{% /callout %}}
+
+#Supplementary notes can be added here, including [code and math](https://sourcethemes.com/academic/docs/writing-markdown-latex/).
+
+---
+