Multilingual Extractive Summarization: Investigating State-of-the-Art Methods for English and Brazilian Portuguese

Jorge, Germano Antonio Zani; Bezerra, Davi Alves; Xavier, Clarissa Castellã; Pardo, Thiago Alexandre Salgueiro

doi:10.1007/978-3-031-79032-4_15

Germano Antonio Zani Jorge⁹,
Davi Alves Bezerra⁹,
Clarissa Castellã Xavier¹⁰ &
…
Thiago Alexandre Salgueiro Pardo⁹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 15413))

Included in the following conference series:

Brazilian Conference on Intelligent Systems

411 Accesses
1 Citation

Abstract

Automatic Text Summarization (ATS) is a Natural Language Processing (NLP) task essential for handling large volumes of information. ATS can be classified into two main types: extractive and abstractive. Extractive summarization selects sentences or phrases directly from the source text(s), while abstractive summarization generates new sentences that try to capture the original meaning of the source text(s). This paper describes our efforts to perform extractive single-document summarization in multilingual contexts. Although various summarization methods, such as PreSumm and HiStruct+, have shown promising results on English corpora like CNN/DM, there is a significant gap in applying these methods to other languages, especially Brazilian Portuguese. Additionally, these summarizers were evaluated with traditional metrics like ROUGE, which has limitations as it primarily measures superficial text overlap. To fill these gaps, we evaluate the effectiveness of these state-of-the-art methods on the CSTNews corpus (with news texts in Brazilian Portuguese) employing ROUGE and the recent BLANC metric, which measures how much the generated summary aids a pre-trained language model (like BERT) in understanding the document. Our contributions include the results and comparison of adapted models, the discussion of the BLANC metric in contrast to ROUGE, and the expansion of resources available to the Portuguese and multilingual NLP community.

Access provided by University of Notre Dame Hesburgh Library. Download conference paper PDF

Enhancing Automatic Text Summarization: Extractive, Abstractive, and Hybrid Approaches

An Arabic Multi-source News Corpus: Experimenting on Single-document Extractive Summarization

Article 04 February 2021

An Abstractive Text Summarization Using Recurrent Neural Network

1 Introduction

Automatic Text Summarization is a fundamental task in the field of Natural Language Processing (NLP), essential for handling large amounts of information. This task aims to condense information from one or more documents, producing summaries that preserve the main points of the original content [13]. Summarization can be classified into two main types: extractive and abstractive. Extractive summarization selects sentences or phrases directly from the original text(s), while abstractive summarization generates new sentences that capture the original meaning of the text(s) [17]. Additionally, summarization can be categorized as single-document, when summarizing an unique text, or multi-document, integrating information from multiple texts [7].

More recently, Large Language Models (LLMs) have been employed for several NLP applications, including generation tasks as text summarization [1]. Despite their popularity, LLMs present an explainability deficit and have some limitations, such as the expensive necessary computational infrastructure and the occurrence of hallucinations, which, in the case of summarization, may hinder the proposal of preserving the original meaning of the texts. On the other hand, extractive summarization methods generally do not suffer from these disadvantages. Interestingly, there has been a resurgence of interest in these methods, with the proposal of novel methods that incorporate classical ideas with new perspectives and also bring new approaches to the task.

Various extractive methods have been proposed and tested mainly on English corpora like CNN/DM [9]. Some notable methods are PreSumm [12], HiStruct+ [20], RankSum [10], MemSum [8] and MatchSum [22], which have demonstrated promising performances in generating summaries. However, there is a significant gap in the application of these methods to other languages and, therefore, limited evaluation on their multilingual capacity. With the web, multilingual summarization is a very relevant field, and there is a growing need for resources and techniques that support multiple languages, expanding the applicability of NLP advances to diverse linguistic communities.

Evaluation in summarization (whether it is multilingual or not) is crucial for advancing research in the field. Traditionally, the ROUGE (Recall-Oriented Understudy for Gisting Evaluation) metric [11] has been widely used to measure the overlap of n-grams between automatic summaries and human reference summaries. However, ROUGE is limited since it only computes superficial text similarity. To address these limitations, other metrics have arisen, as BLANC [21]. BLANC evaluates summaries by using them to help pre-trained language models (such as BERT [5]) to perform the token unmasking task, with the supposition that a “more helpful” summary may improve the model success on the task.

In this context, this work aims at evaluating the effectiveness of some more recent extractive methods for Brazilian Portuguese, trying to assess their multilingual performance. We run some selected state-of-the-art and more classical methods on news corpora for English and Brazilian Portuguese, using well-known reference summarization corpora. We compare the results using both ROUGE and BLANC metrics, in order to provide a more comprehensive evaluation.

Next section briefly describes the main related work. The datasets that we use are detailed in Sect. 3. Section 4 brings our experiment setup, while discussion of results and final remarks are made in Sects. 5 and 6, respectively.

2 Related Work

Several automatic extractive summarization methods have been developed in the area, especially using deep learning models and word embeddings. Here, we briefly overview classical and more recent methods, including those that are considered state-of-the-art in the area.

TextRank [14] is a graph-based ranking algorithm used for both sentence and keyword extraction. Inspired by Google’s PageRank algorithm, TextRank applies the idea of “voting" to determine the importance of textual units. It constructs a graph where nodes represent sentences or words, and edges indicate co-occurrence or similarity relations. The algorithm iterates over the graph until the importance scores of the nodes converge. TextRank is an unsupervised method that has proven competitive in established benchmarks like DUC-2002, demonstrating robustness and portability across different domains and languages.

Centroid-embedding [19] summarization leverages the compositional capabilities of word embeddings to capture semantic relationships between words and sentences. This method constructs a centroid vector, which represents the central theme of the document, by summing the embeddings of the most significant words, determined by their TF-IDF scores. Each sentence is then represented as a sum of its word embeddings, and sentences closest to the centroid vector are selected for the summary. This approach addresses the limitations of traditional bag-of-words models, which often fail to capture semantic similarities between sentences with different words but similar meanings. The centroid-based method has proven effective in both multi-document and multilingual summarization tasks, offering competitive performance compared to more complex deep learning models. Its simplicity and robustness make it an attractive option for extractive summarization across various domains and languages.

PreSumm [12], also known as BERTSUM, has extractive and abstractive summarization strategies. It uses the BERT architecture to understand the context of sentences and select the most relevant ones for the summary. The technique includes pretraining on general language tasks and fine-tuning specific to the summarization task. PreSumm performs summarization in two phases: (1) an extraction phase where the most important sentences are selected, and (2) an abstractive generation phase where these sentences can be refined to improve the fluency and cohesion of the final summary. For this work, we used only the extractive strategy of PreSumm.

HiStruct+ [20] incorporates hierarchical structure information into pre-trained language models like BERT and RoBERTa to improve extractive summarization. This model is designed to handle the intrinsic structure of documents, considering the hierarchy of sections and subtitles when selecting sentences for the summary. HiStruct+ is particularly effective for long and complex documents with clear hierarchical structures, such as scientific articles. The approach improves summarization accuracy by preserving the logical and semantic structure of the documents.

Other relevant methods are RankSum [10], MemSum [8] and MatchSum [22], but we do not address them in this paper because they have been outperformed by some of the above methods in the literature and also in some of our initial experiments (that we do not report in this paper).

3 Datasets

For our experiments for Brazilian Portuguese, we adopt the CSTNews corpus^{Footnote 1} [2]. It is a reference and widely known corpus developed to support research in automatic summarization of journalistic texts in Brazilian Portuguese. The corpus comprises 50 clusters of news articles, totaling 150 texts and their respective summaries. Each cluster contains 2 to 3 news articles about the same event, collected from various Brazilian media sources such as Folha de São Paulo, Estadão, O Globo, Jornal do Brasil, and Gazeta do Povo, according to their impact at the time of publication. The corpus contains 2,088 sentences and 47,240 words, with an average of 2.8 texts, 41.76 sentences, and 944.8 words per cluster. In addition to the original texts, each cluster includes single-document manual summaries, multiple-document manual summaries, and automatic summaries. The corpus is also manually annotated in various ways for syntax, semantics and discourse information.

CNN/Daily Mail [9] is a corpus widely used in automatic summarization research for English. It comprises approximately 286,817 article-summary pairs, each containing a news article and an associated summary. The articles are sourced from the CNN and Daily Mail news websites, and the summaries are human-generated bullet points summarizing the main points of the news articles. The corpus was created initially for the passage-based question-answering task but was adapted for summarization research by restoring the bullet points to form multi-sentence summaries. In the training set, the source documents have an average of 766 words distributed over 29.74 sentences, while the summaries consist of 53 words distributed over 3.72 sentences. The corpus is available in two versions: one with real entity names and another with anonymized entities that were replaced by document-specific IDs, facilitating vocabulary reduction and experimentation with deep learning models.

4 Experiment Setup

The corpora data was divided into 70%, 15%, and 15% for training, validation, and testing for the experiments (as some methods needed training). The original configurations of each summarizer were maintained, adapting them to the CSTNews corpus. The experimental process is summarized in the following subsections.

4.1 Pre-processing

The NLTK library was used for tokenization, stopword removal, and normalization of Portuguese texts. This step is crucial to ensure that the models, originally trained in English, can operate effectively in the new language. Tokenization segments the text into smaller units, such as words or subwords. Stopword removal eliminates common words that do not significantly contribute to the task. Normalization adjusts the text to a more consistent form, facilitating subsequent processing. For HiStruct+ and PreSumm, we performed similar pre-processing using the Stanford CoreNLP tool.

4.2 Model Training and Adjustment

We applied the methods’ original codes with minimal modifications. We also fine-tuned the model parameters to optimize their performance on CSTNews. For this, we adapted the tokenization processes and changed some hyperparameters specific to the language and style of the texts in the CSTNews corpus.

We aimed to replicate the original training conditions of the models as closely as possible despite our limitations. The model used in PreSumm was BERT-base-uncased, while HiStruct+ employed RoBERTa-base, as these were the models used in the original works.

Since we only had access to a CPU, the number of epochs and batch sizes had to be adjusted. PreSumm and HiStruct+ were trained for 7 epochs with a batch size of 14.

The learning rates were the same as in the original works: 2e-3 for PreSumm and HiStruct+.

4.3 Evaluation

The generated summaries were evaluated using ROUGE and BLANC. ROUGE measures the overlap of n-grams between the automatic and human reference summaries. The most commonly used variants are ROUGE-1, which measures unigram (individual words) overlap, ROUGE-2, which measures bigram (word pairs) overlap, and ROUGE-L, which measures the longest common subsequence overlap. Each variant can be evaluated in terms of F1 score, precision, and recall. The F1-score is the harmonic mean of precision and recall, balancing the two. Precision measures the proportion of n-grams in the generated summary that is present in the reference summary. In contrast, recall measures the proportion of n-grams in the reference summary that is present in the generated summary. Although ROUGE is widely used for its simplicity and effectiveness, it is criticized for not adequately capturing summaries’ semantic quality and coherence, focusing mainly on superficial text similarity. Conversely, BLANC measures how much the generated summary aids a pre-trained language model (like BERT) in understanding the document. It uses the masked token task (Cloze task) to evaluate the functional utility of a summary. In BLANC-help mode (which is the one we used), the summary is concatenated to each document sentence during inference, and the model’s ability to predict masked words with the summary is measured. The difference in prediction accuracy with and without the summary indicates the summary’s quality. A higher BLANC score means the summary provides more contextual information that helps the model to understand the document better. Unlike ROUGE, BLANC does not require human-written reference summaries.

5 Results and Discussion

Tables 1 and 2 show the ROUGE results for the summarization methods on the CSTNews and CNN/DM corpora. For CSTNews, PreSumm demonstrates the highest performance in most ROUGE metrics, particularly in ROUGE-1 F1 (55.55), ROUGE-1 Precision (67.91), and ROUGE-2 Precision (49.64). This indicates its high effectiveness in selecting relevant sentences that closely match the reference summaries at the unigram and bigram levels. However, its lower recall scores suggest that, while maintaining high precision, it may miss some relevant content. For the CNN/DM dataset, PreSumm also shows the highest performance with ROUGE-1 score of 53.37. TextRank and Centroid-embedding have lower scores (30.14 and 18.12), consistent with their lower performance on the CSTNews corpus too and demonstrating the need for advanced models to handle complex text summarization tasks.

Table 1. Results on ROUGE metrics for CSTNews dataset

Full size table

Table 2. Results on ROUGE metrics for CNN/DM dataset

Full size table

Table 3. Results on BLANC metric for CSTNews and CNN/DM

Full size table

Table 3 presents the BLANC results for the same models. Centroid-embedding achieves the highest BLANC score (57.74) for CSTNews. Despite its high ROUGE scores, PreSumm produces the lowest BLANC score, which shows the importance of using complementary metrics for a more holistic evaluation. For the CNN/DM dataset, TextRank achieves the highest BLANC score (26.20). Centroid-embedding also performs well, with score of 24.35. PreSumm again shows a relatively low BLANC score.

As illustration, Table 4 shows a summary generated by PreSumm, allowing comparing it with the human-written summary. The summary generated by PreSumm accurately captured Fabiana Murer’s victory and the matching of the medal record, but omitted details about the competitors and the achieved marks.

Table 4. Comparison between summary generated by PreSumm and the human-written summary

Full size table

Table 5 shows three CSTNews summaries generated by PreSumm, the summarizer which achieved the best ROUGE results. These summaries were randomly chosen for a manual human evaluation. We used some of the known TAC (Text Analysis Conference) criteria [4], which include overall responsiveness and readability. The assessment of overall responsiveness examines how effectively a summary addresses the information need outlined in the topic statement, considering both the content and linguistic quality of the summary. The readability score evaluates the summary’s fluency and structure, independent of content, and is based on factors such as grammatical correctness, lack of redundancy, referential clarity, focus, structure, and coherence. These criteria are evaluated on a five-point scale: from very poor to very good.

We evaluated these three random summaries as “very good" based on the above criteria once they show no redundancy, no referential problems, and very good structure and coherence. Their high ROUGE scores can reflect the responsiveness of the summaries. Once the summaries generated are close to the golden standard, that is, the summary generated by humans, they are expected to cover the document’s main content, thus, having a very good responsiveness.

Table 5. Three CSTNews summaries generated by PreSumm

Full size table

Table 6 shows three CSTNews summaries generated by Centroid-embedding, the summarizer which achieved the best BLANC results. These summaries were also randomly chosen for a manual human evaluation and evaluated under the TAC criteria. Similarly to the previous summaries, we also evaluate these three as “very good" regarding the overall responsiveness and readability.

Table 6. Three CSTNews summaries generated by Centroid-embedding

Full size table

These results raise an important question: why did the Centroid-embedding achieve such high BLANC scores? Since BLANC measures how much the generated summary aids a pre-trained language model in understanding the original document, its high BLANC score suggests a high amount of informational content in the summaries. However, based on these three randomly chosen summaries, we could not hypothesize why they had higher BLANC scores than PreSumm, for example. Thus, we encourage further research and new experiments with both ROUGE and BLANC metrics.

While providing valuable insights into the performance of various summarization models on the CSTNews corpus, the present study is subject to several methodological limitations. One significant limitation is the size of the dataset. Although a reference in the area, the CSTNews corpus contains only 150 texts, and more than this limited number of texts is needed for training robust summarization models. So, we emphasize that we need more resources in languages other than English, to enhance the reliability and generalizability of summarization models. We are aware of a recent corpus, RecognaSumm [15], which contains over 130.000 journalistic texts in Brazilian Portuguese, and we plan to perform experiments with it in the future.

Another limitation of this study is the computational resources used. Due to the unavailability of GPU resources, the experiments were conducted using CPUs. The lack of GPU access limited our ability to fine-tune model parameters optimally, potentially impacting the performance of the summarizers. With the availability of GPUs, future studies could perform more extensive parameter tuning, leading to better results.

More efforts are also needed to refine and develop evaluation metrics beyond traditional measures like ROUGE. Metrics such as BLANC, which supposedly consider semantic coherence and fluency, should be further explored and validated. Developing new metrics that can more accurately reflect the quality of summaries in different languages and contexts will be crucial for advancing the field.

6 Final Remarks

We contributed to the investigation of multilingual summarization by evaluating state-of-the-art extractive summarization models on the Brazilian Portuguese CSTNews corpus and the English CNN/DM corpus. Our study utilized ROUGE and the more recently developed BLANC metric, providing insights into the models’ performance. The findings revealed that PreSumm performed well according to ROUGE metrics, while Centroid-embedding had good BLANC scores for both languages.

Future work includes testing other methods for the previous languages, as well as other languages, including languages of different linguistic typologies. We also aim at using corpora of diverse genres and domains. Such evaluation is necessary for determining the potential of current and future summarization methods for multilingual summarization.

Another relevant issue consists in exploring multilingual multi-document summarization, leveraging some previous researches and datasets that exist for Brazilian Portuguese, as the ones of [3] and [6]. Exploring the multilingual performance of some classic single and multi-document methods that were created for Portuguese may be other interesting endeavor, as it is the case of GistSumm [16] – the first summarization system for Portuguese – and RSumm [18].

The interested reader may find the datasets, the source codes of the methods and other details about this work at the POeTiSA project website (at https://sites.google.com/icmc.usp.br/poetisa).

Notes

1.
https://sites.icmc.usp.br/taspardo/sucinto/cstnews.html.

References

Bommasani, R., et al.: On the opportunities and risks of foundation models (2022)
Google Scholar
Cardoso, P.C., et al.: Cstnews-a discourse-annotated corpus for single and multi-document summarization of news texts in Brazilian Portuguese. In: Proceedings of the 3rd RST Brazilian Meeting, pp. 88–105 (2011)
Google Scholar
Chaud, M., Felippo, A.D.: Exploring content selection strategies for multilingual multi-document summarization based on the universal network language (unl). REVISTA DE ESTUDOS DA LINGUAGEM 26(1), 45–71 (2017)
Article Google Scholar
Dang, H.T., Owczarzak, K., et al.: Overview of the tac 2008 update summarization task. In: TAC (2008)
Google Scholar
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding (2018). https://doi.org/10.48550/arXiv.1810.04805. http://arxiv.org/abs/1810.04805v2
Di Felippo, A., Tosta, F.E.S., Pardo, T.A.S.: Applying lexical-conceptual knowledge for multilingual multi-document summarization. In: Silva, J., Ribeiro, R., Quaresma, P., Adami, A., Branco, A. (eds.) PROPOR 2016. LNCS (LNAI), vol. 9727, pp. 38–49. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-41552-9_4
Chapter Google Scholar
El-Kassas, W.S., Salama, C.R., Rafea, A.A., Mohamed, H.K.: Automatic text summarization: a comprehensive survey. Expert Syst. Appl. 165, 113679 (2021). https://doi.org/10.1016/j.eswa.2020.113679. https://linkinghub.elsevier.com/retrieve/pii/S0957417420305030
Gu, N., Ash, E., Hahnloser, R.: MemSum: extractive summarization of long documents using multi-step episodic Markov decision processes. In: Muresan, S., Nakov, P., Villavicencio, A. (eds.) Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, vol. 1: Long Papers, pp. 6507–6522. Association for Computational Linguistics, Dublin (2022). https://doi.org/10.18653/v1/2022.acl-long.450. https://aclanthology.org/2022.acl-long.450
Hermann, K.M., et al.: Teaching machines to read and comprehend. Adv. Neural Inf. Process. Syst. 28 (2015)
Google Scholar
Joshi, A., Fidalgo, E., Alegre, E., Alaiz-Rodriguez, R.: Ranksum–an unsupervised extractive text summarization based on rank fusion. Expert Syst. Appl. 200, 116846 (2022)
Article Google Scholar
Lin, C.Y.: ROUGE: a package for automatic evaluation of summaries (2004)
Google Scholar
Liu, Y., Lapata, M.: Text summarization with pretrained encoders. In: Inui, K., Jiang, J., Ng, V., Wan, X. (eds.) Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 3730–3740. Association for Computational Linguistics, Hong Kong (2019). https://doi.org/10.18653/v1/D19-1387. https://aclanthology.org/D19-1387
Mani, I.: Automatic summarization. Natural language processing (2001). https://doi.org/10.1075/nlp.3
Mihalcea, R., Tarau, P.: Textrank: bringing order into text. In: Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, pp. 404–411 (2004)
Google Scholar
Paiola, P.H., Garcia, G.L., Jodas, D.S., Correia, J.V.M., Sugi, L.A., Papa, J.P.: Recognasumm: a novel brazilian summarization dataset. In: Proceedings of the 16th International Conference on Computational Processing of Portuguese, pp. 575–579 (2024)
Google Scholar
Pardo, T.A.S., Rino, L.H.M., Nunes, M.G.V.: GistSumm: a summarization tool based on a new extractive method. In: Mamede, N.J., Trancoso, I., Baptista, J., das Graças Volpe Nunes, M. (eds.) PROPOR 2003. LNCS (LNAI), vol. 2721, pp. 210–218. Springer, Heidelberg (2003). https://doi.org/10.1007/3-540-45011-4_34
Chapter MATH Google Scholar
Radev, D.R., McKeown, K., Hovy, E.: Introduction to the Special Issue on Summarization (2002)
Google Scholar
Ribaldo, R., Christina Figueira Cardoso, P., Alexandre Salgueiro Pardo, T.: Exploring the subtopic-based relationship map strategy for multi-document summarization. Revista de Informática Teórica e Aplicada 23(1), 183–211 (2016). https://doi.org/10.22456/2175-2745.59104. https://seer.ufrgs.br/index.php/rita/article/view/RITA-VOL23-NR1-183
Rossiello, G., Basile, P., Semeraro, G.: Centroid-based text summarization through compositionality of word embeddings. In: Giannakopoulos, G., Lloret, E., Conroy, J.M., Steinberger, J., Litvak, M., Rankel, P., Favre, B. (eds.) Proceedings of the MultiLing 2017 Workshop on Summarization and Summary Evaluation Across Source Types and Genres, pp. 12–21. Association for Computational Linguistics, Valencia (2017). https://doi.org/10.18653/v1/W17-1003. https://aclanthology.org/W17-1003
Ruan, Q., Ostendorff, M., Rehm, G.: HiStruct+: improving extractive text summarization with hierarchical structure information. In: Muresan, S., Nakov, P., Villavicencio, A. (eds.) Findings of the Association for Computational Linguistics: ACL 2022, pp. 1292–1308. Association for Computational Linguistics, Dublin (2022). https://doi.org/10.18653/v1/2022.findings-acl.102. https://aclanthology.org/2022.findings-acl.102
Vasilyev, O., Dharnidharka, V., Bohannon, J.: Fill in the blanc: human-free quality estimation of document summaries. arXiv preprint arXiv:2002.09836 (2020)
Zhong, M., Liu, P., Chen, Y., Wang, D., Qiu, X., Huang, X.: Extractive summarization as text matching. In: Jurafsky, D., Chai, J., Schluter, N., Tetreault, J. (eds.) Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 6197–6208. Association for Computational Linguistics, Online (2020). https://doi.org/10.18653/v1/2020.acl-main.552. https://aclanthology.org/2020.acl-main.552

Download references

Acknowledgments

To SiDi and CAPES (Coordenação de Aperfeiçoamento de Pessoal de Nível Superior), for supporting this work. The author Thiago Alexandre Salgueiro Pardo also thanks the Center for Artificial Intelligence of the University of São Paulo (C4AI - http://c4ai.inova.usp.br/), with support by the São Paulo Research Foundation (FAPESP grant #2019/07665-4) and by the IBM Corporation, and the Ministry of Science, Technology and Innovation, with resources of Law N. 8,248, of October 23, 1991, within the scope of PPI-SOFTEX, coordinated by Softex and published as Residence in TIC 13, DOU 01245.010222/2022-44.

Author information

Authors and Affiliations

Núcleo Interinstitucional de Linguística Computacional (NILC), Instituto de Ciências Matemáticas e de Computação, Universidade de São Paulo, São Carlos, SP, Brazil
Germano Antonio Zani Jorge, Davi Alves Bezerra & Thiago Alexandre Salgueiro Pardo
SiDi, Campinas, SP, Brazil
Clarissa Castellã Xavier

Authors

Germano Antonio Zani Jorge
View author publications
Search author on:PubMed Google Scholar
Davi Alves Bezerra
View author publications
Search author on:PubMed Google Scholar
Clarissa Castellã Xavier
View author publications
Search author on:PubMed Google Scholar
Thiago Alexandre Salgueiro Pardo
View author publications
Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Germano Antonio Zani Jorge .

Editor information

Editors and Affiliations

Universidade Federal Fluminense, Niterói, Brazil
Aline Paes
Instituto Tecnológico de Aeronáutica, São José dos Campos, Brazil
Filipe A. N. Verri

Ethics declarations

Disclosure of Interests

The authors have no competing interests to declare that are relevant to the content of this article.

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jorge, G.A.Z., Bezerra, D.A., Xavier, C.C., Pardo, T.A.S. (2025). Multilingual Extractive Summarization: Investigating State-of-the-Art Methods for English and Brazilian Portuguese. In: Paes, A., Verri, F.A.N. (eds) Intelligent Systems. BRACIS 2024. Lecture Notes in Computer Science(), vol 15413. Springer, Cham. https://doi.org/10.1007/978-3-031-79032-4_15

Download citation

DOI: https://doi.org/10.1007/978-3-031-79032-4_15
Published: 30 January 2025
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-79031-7
Online ISBN: 978-3-031-79032-4
eBook Packages: Computer ScienceComputer Science (R0)

Keywords

Publish with us

Policies and ethics

Multilingual Extractive Summarization: Investigating State-of-the-Art Methods for English and Brazilian Portuguese

Abstract

Similar content being viewed by others

Enhancing Automatic Text Summarization: Extractive, Abstractive, and Hybrid Approaches

An Arabic Multi-source News Corpus: Experimenting on Single-document Extractive Summarization

An Abstractive Text Summarization Using Recurrent Neural Network

1 Introduction

2 Related Work

3 Datasets

4 Experiment Setup

4.1 Pre-processing

4.2 Model Training and Adjustment

4.3 Evaluation

5 Results and Discussion

6 Final Remarks

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Ethics declarations

Disclosure of Interests

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Keywords

Publish with us