Towards Data Augmentation for Supervised Code Translation

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

  • Binger Chen
  • Jacek Golebiowski
  • Ziawasch Abedjan

External Research Organisations

  • Technische Universität Berlin
  • Amazon.com, Inc.
View graph of relations

Details

Original languageEnglish
Title of host publicationProceedings of the 2024 IEEE/ACM 46th International Conference on Software Engineering
Subtitle of host publicationCompanion Proceedings
PublisherIEEE Computer Society
Pages352-353
Number of pages2
ISBN (electronic)9798400705021
Publication statusPublished - 23 May 2024
Event46th International Conference on Software Engineering: Companion, ICSE-Companion 2024 - Lisbon, Portugal
Duration: 14 Apr 202420 Apr 2024

Abstract

Supervised learning is a robust strategy for data-driven program translation. This work addresses the challenge of insufficient parallel training data in code translation by exploring two innovative data augmentation methods: a rule-based approach specifically designed for code translation datasets and a retrieval-based method leveraging unorganized code repositories.

ASJC Scopus subject areas

Cite this

Towards Data Augmentation for Supervised Code Translation. / Chen, Binger; Golebiowski, Jacek; Abedjan, Ziawasch.
Proceedings of the 2024 IEEE/ACM 46th International Conference on Software Engineering: Companion Proceedings. IEEE Computer Society, 2024. p. 352-353.

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Chen, B, Golebiowski, J & Abedjan, Z 2024, Towards Data Augmentation for Supervised Code Translation. in Proceedings of the 2024 IEEE/ACM 46th International Conference on Software Engineering: Companion Proceedings. IEEE Computer Society, pp. 352-353, 46th International Conference on Software Engineering: Companion, ICSE-Companion 2024, Lisbon, Portugal, 14 Apr 2024. https://doi.org/10.1145/3639478.3643115
Chen, B., Golebiowski, J., & Abedjan, Z. (2024). Towards Data Augmentation for Supervised Code Translation. In Proceedings of the 2024 IEEE/ACM 46th International Conference on Software Engineering: Companion Proceedings (pp. 352-353). IEEE Computer Society. https://doi.org/10.1145/3639478.3643115
Chen B, Golebiowski J, Abedjan Z. Towards Data Augmentation for Supervised Code Translation. In Proceedings of the 2024 IEEE/ACM 46th International Conference on Software Engineering: Companion Proceedings. IEEE Computer Society. 2024. p. 352-353 doi: 10.1145/3639478.3643115
Chen, Binger ; Golebiowski, Jacek ; Abedjan, Ziawasch. / Towards Data Augmentation for Supervised Code Translation. Proceedings of the 2024 IEEE/ACM 46th International Conference on Software Engineering: Companion Proceedings. IEEE Computer Society, 2024. pp. 352-353
Download
@inproceedings{d06c11f9ac824ef8ada40c016a50d613,
title = "Towards Data Augmentation for Supervised Code Translation",
abstract = "Supervised learning is a robust strategy for data-driven program translation. This work addresses the challenge of insufficient parallel training data in code translation by exploring two innovative data augmentation methods: a rule-based approach specifically designed for code translation datasets and a retrieval-based method leveraging unorganized code repositories.",
author = "Binger Chen and Jacek Golebiowski and Ziawasch Abedjan",
note = "Publisher Copyright: {\textcopyright} 2024 IEEE Computer Society. All rights reserved.; 46th International Conference on Software Engineering: Companion, ICSE-Companion 2024 ; Conference date: 14-04-2024 Through 20-04-2024",
year = "2024",
month = may,
day = "23",
doi = "10.1145/3639478.3643115",
language = "English",
pages = "352--353",
booktitle = "Proceedings of the 2024 IEEE/ACM 46th International Conference on Software Engineering",
publisher = "IEEE Computer Society",
address = "United States",

}

Download

TY - GEN

T1 - Towards Data Augmentation for Supervised Code Translation

AU - Chen, Binger

AU - Golebiowski, Jacek

AU - Abedjan, Ziawasch

N1 - Publisher Copyright: © 2024 IEEE Computer Society. All rights reserved.

PY - 2024/5/23

Y1 - 2024/5/23

N2 - Supervised learning is a robust strategy for data-driven program translation. This work addresses the challenge of insufficient parallel training data in code translation by exploring two innovative data augmentation methods: a rule-based approach specifically designed for code translation datasets and a retrieval-based method leveraging unorganized code repositories.

AB - Supervised learning is a robust strategy for data-driven program translation. This work addresses the challenge of insufficient parallel training data in code translation by exploring two innovative data augmentation methods: a rule-based approach specifically designed for code translation datasets and a retrieval-based method leveraging unorganized code repositories.

UR - http://www.scopus.com/inward/record.url?scp=85194884166&partnerID=8YFLogxK

U2 - 10.1145/3639478.3643115

DO - 10.1145/3639478.3643115

M3 - Conference contribution

AN - SCOPUS:85194884166

SP - 352

EP - 353

BT - Proceedings of the 2024 IEEE/ACM 46th International Conference on Software Engineering

PB - IEEE Computer Society

T2 - 46th International Conference on Software Engineering: Companion, ICSE-Companion 2024

Y2 - 14 April 2024 through 20 April 2024

ER -