Application research on table structure recognition and information extraction in sci-tech academic journals based on visual studio tools for Office technology

Authors

  • Lipeng Wang China Medical University Journal Center, Shenyang 110122, Liaoning Province, China
  • Jie Chen Journal Center of China Medical University, Shenyang 110001, Liaoning Province, China
  • Chunyu Zheng China Medical University Journal Center, Shenyang 110122, Liaoning Province, China
  • Jie Feng China Medical University Journal Center, Shenyang 110122, Liaoning Province, China

DOI:

https://doi.org/10.54844/ep.2023.0412

Keywords:

table, visual studio tools for office, journal, editor

Abstract

The premise of intelligent table processing in Word is to extract the table structure and text information. By using visual studio tools for Office (VSTO) to obtain the extensible markup language (XML) information of the table, the structural relationship of the table and the text format of each cell can be further recognized. Compared with Visual Basic for Applications (VBA) technology, VSTO technology is slower in handling Word, but it has better extensibility and efficiency than VBA. VSTO technology can effectively recognize the structure of the table and extract information, providing possibilities for subsequent intelligent processing.

Downloads

Published

2023-07-31

How to Cite

1.
Wang L, Chen J, Zheng C, Feng J. Application research on table structure recognition and information extraction in sci-tech academic journals based on visual studio tools for Office technology. EP. 2023;1. doi:10.54844/ep.2023.0412

Issue

Section

Digital Publishing