Page 29 - 2024
P. 29

Ph.D.
                                                                                 (Engineering & Technology)
          METAHEURISTIC APPROACHES BASED TEXT SUMMARIZATION
          FOR MULTIPLE DOCUMENTS

          Ph.D. Scholar : Patel Praveshkumar Somabhai
          Research Supervisor : Dr. Paresh M. Solanki



                                                                                Regi. No.: 20276341005
          Abstract :
          The  massive  amount  of  textual  information  available  on  the  Internet,  including  news,
          blogs,  websites,  and  user  reviews  continues  to  grow  rapidly  every  day.  Consequently,
          users face significant challenges in finding the specific information they want to find, as it
          is time-consuming to read and understand all the text they encounter in search results.
          Moreover, much of the text contains repeated or irrelevant content. Therefore, there is a
          need for text summarization and shortening text resources has become urgent and much
          more important.

          Multi-document summarization is a more challenging task compared to summarizing a
          single  document.  It  involves  dealing  with  multiple  documents  that  contain  similar  and
          appropriate information on a specific topic or a single document that covers information
          from  various  domains.  The  complexity  arises  from  the  presence  of  conflicting  views,
          biases, and thematic diversity in large document sets. Additional issues include handling
          redundant information across multiple documents, compressing the content effectively
          and  selecting  sentences  efficiently  for  extraction  at  a  reasonable  speed  and  specific
          amounts  of  words  form  multiple  documents.  These  challenges  can  be  addressed  by
          employing  metaheuristic  techniques  in  text  summarization  process  which  plays
          important role to maintain control over relevance and coverage and remove redundancy
          in summary. Therefore, there is a necessity to explore text summarization methods based
          on metaheuristic approaches, as they can generate high-quality summaries from multiple
          documents.  In  the  proposed  metaheuristic-based  text  summarization  framework  for
          multiple  documents,  firstly  identifying  and  implemented  effective  sentence  scoring
          features. Further based on selected sentence features a fitness function is constructed,
          then firefly, cuckoo, ant colony and Teaching-learning based metaheuristic techniques are
          applied  for  generation  of  precise  text  summary.  Finally,  generated  summaries  are
          evaluated  using  ROUGE  and  BLEU  matrices.  Experiment  results  concluded  that
          incorporating  metaheuristic  techniques  in  multiple  documents  text  summarization  are
          outperformed compared to existing approaches of multi-document text summarization.

          Key words: Text Summarization, Natural language processing, Metaheuristic, Summary
          Evaluation, Firefly, Cuckoo Search, Teaching Learning, ROUGE, BLEU.

                                                                                             01
   24   25   26   27   28   29   30   31   32   33   34