Rancang Bangun Aplikasi Peringkasan Dokumen PDF Berbasis Web Menggunakan Teknologi Natural Language Processing

Authors

  • Yeskarwani Gulo Author

Keywords:

Artificial Intelligence, Natural Language Processing, Document Summary, Text Summarization, OpenRouter API, Web Applications, MERN Stack

Abstract

The development of artificial intelligence technology (Artificial Intelligence) especially in the field of Natural Language Processing (NLP) has made a significant contribution in the processing and analysis of digital documents. One of its applications is in the process of automatically summarizing documents that aims to help users get the gist of information from long documents more quickly and efficiently. This research aims to design and implement an artificial intelligence-based web application capable of automatically summarizing and analyzing PDF-formatted documents by utilizing the OpenRouter Application Programming Interface (API) service. The research method used is the Waterfall software development model, which covers the stages of needs analysis, system planning, implementation, testing, and maintenance. The research approach is carried out through literature studies and analysis of user needs to ensure that the developed system is able to answer existing problems. Applications are developed using MERN Stack technology which consists of MongoDB as a database, Express.js and Node.js as backend, and React.js as frontend. Integration with the OpenRouter API allows the system to leverage Transformer-based artificial intelligence models to perform natural language processing effectively. The results of this research show that the developed application is able to extract text from PDF documents, process information using NLP techniques, and produce a summary of documents that is more concise, accurate, and easy to understand by users. With this system, the document analysis process can be done more efficiently so that it helps users understand the content of the document quickly without having to read the entire content of the document.

References

Alfin, M., Abidin, Z., & Safitri, P. M. N. (2024). Peringkasan Multi Dokumen Berbahasa Indonesia Menggunakan Metode Recurrent Neural Network. Techno.Com, 23(1). https://doi.org/10.62411/tc.v23i1.9605

Astuti, R. H., Muljono, M., & Sutriawan, S. (2024). Indonesian News Text Summarization Using MBART Algorithm. Scientific Journal of Informatics. https://doi.org/10.15294/sji.v11i1.49224

Halimah, H., Agustian, S., & Ramadhani, S. (2022). Peringkasan Teks Otomatis pada Artikel Berbahasa Indonesia Menggunakan Algoritma LexRank. CoSciTech. https://doi.org/10.37859/coscitech.v3i3.4300

Hartawan, G., Maylawati, D. S., & Uriawan, W. (2024). Bidirectional and Auto-Regressive Transformer (BART) for Indonesian Abstractive Text Summarization. Jurnal Informatika Polinema, 10(4), 535–542. https://doi.org/10.33795/jip.v10i4.5242

Hidayat, R., & Nugraha, A. (2020). Analisis Metode TF-IDF dalam Peringkasan Teks Bahasa Indonesia. Jurnal Teknologi Informasi dan Komputer.

Kevin Malau, J., Tarigan, D. S., Simangunsong, D. R., Rajagukguk, R. T., & Ginting, A. K. (2026). Implementasi Algoritma Textrank Untuk Peringkasan Teks Berbahasa Indonesia Pada Aplikasi Berbasis Flask. Jurnal Ilmu Komputer dan Informatika, 2(3), 61–66.

Kurnia, Y., Rossi, A., Junaedi, & Hermawan, A. (2025). Machine Learning Framework for Automatic Speech Transcription and Summarization Using HMM and TextRank. Jurnal Sains dan Teknologi, 14(1). https://doi.org/10.23887/jstundiksha.v14i1.94184

Kurniawan, D., & Santoso, H. (2020). Implementasi Natural Language Processing untuk Sistem Peringkasan Dokumen Otomatis. Jurnal Sistem Informasi.

Kusuma, T. B., Widiartha, I. M., & Suputra, I. P. G. H. (2025). Implementasi LexRank dan BERT2GPT dalam Auto Summarization Teks Bahasa Indonesia. Jurnal Nasional Teknologi Informasi dan Aplikasinya, 4(1), 19–26. https://doi.org/10.24843/JNATIA.2025.v04.i01.p03

Lim, V. I., Fitria, F., & Hafid, M. (2024). Implementasi Text Summarization pada Ulasan Aplikasi Mobile JKN Menggunakan TF-IDF dan Cosine Similarity. Konvergensi, 21(1). https://doi.org/10.30996/konv.v21i1.12196.

Pratama, R., & Wibowo, A. (2021). Automatic Text Summarization pada Dokumen Berita Berbahasa Indonesia Menggunakan Metode TextRank. Jurnal Informatika.

Purnama, I. N., & Utami, N. N. W. (2024). Implementasi Peringkas Dokumen Berbahasa Indonesia Menggunakan Metode Text-to-Text Transfer Transformer (T5).

Saputra, M., & Rahman, F. (2020). Penerapan Algoritma TextRank pada Sistem Automatic Text Summarization. Jurnal Informatika Indonesia.

Satya, M. W. B. D., & Luthfiarta, A. (2025). Leveraging BERT and T5 for Comprehensive Text Summarization on Indonesian Articles. Jurnal Sistem Cerdas, 8(2), 191–202. https://doi.org/10.37396/jsc.v8i2.458

Setiawan, A., & Nugroho, Y. (2022). Penerapan Natural Language Processing untuk Peringkasan Dokumen Berbahasa Indonesia. Jurnal Teknologi Informasi dan Ilmu Komputer.

Surniandari, A., Rachmi, H., Fauzi, A., & Utami, L. D. (2024). Comparison of News Text Summarization Using NLTK and TextRank Based on Python Programming. Jurnal Teknik Komputer AMIK BSI, 11(1). https://doi.org/10.31294/jtk.v11i1.12108

Wiratmoko, G., Thamrin, H., & Pamungkas, E. W. (2025). Performance of Machine Learning Algorithms on Automatic Summarization of Indonesian Language Texts. Jurnal Online Informatika, 10(1). https://doi.org/10.15575/join.v10i1.1506

Yani, M., Khodijah, N. S., Rendi, & Mustamiin, M. (2024). Aplikasi Peringkas Teks Bahasa Indonesia Menggunakan Model Text-to-Text Transfer Transformer (T5). IKRA-ITH Informatika, 9(2), 78–86. https://doi.org/10.37817/ikraithinformatika.v9i2.4392

Zamzam, M. A., Holle, C. C., & Holle, K. F. H. (2023). Sistem Automatic Text Summarization Menggunakan Algoritma TextRank. MATICS.

Downloads

Published

2026-04-04