TY - JOUR
T1 - A quantitative approach for similarity comparison of the terminologies in standard documents
AU - Meng, Huixing
AU - Setthapitayakul, Kanittha
AU - Li, Yan Fu
N1 - Publisher Copyright:
© IEOM Society International.
PY - 2019
Y1 - 2019
N2 - The application of standards is effective for ensuring the expected function of a system in a specific scenario. A key step of the standard enaction or implementation is to precisely comprehend the terminologies in the standards via literal comparison. Current methods for such comparison usually rely heavily on labor, which is time consuming and error-prone. With the rapid increase in the number of the standard documents, it is essential to develop an approach to automatize the comparison process. In this work, we proposed a methodology for the computerized comparison of the terms and definitions in standards. Based on the standard structures, the methodology is developed in three steps: the PDF (Portable Document Format) text conversion, the terms and definitions extraction, and comparison. (1) According to the PDF types, either scanned or digitally created, we provided corresponding methods for converting the PDF files. (2) Regarding the terms and definitions extraction, we identified content structures and logical elements, and extracted definition sections and terms. (3) For the comparison step, we evaluated the similarities between the terminologies from the semantic and syntactic aspects. Reliability, availability, maintainability, and safety (RAMS) are crucial attributes to evaluate the performance of a system. In the experimental studies, we compared the RAMS standards issued by IEC, IEEE, ISO, and the Society for Automotive Engineering (SAE). The results show that the proposed methodology is capable of evaluating the similarities of the terms and definitions in standards.
AB - The application of standards is effective for ensuring the expected function of a system in a specific scenario. A key step of the standard enaction or implementation is to precisely comprehend the terminologies in the standards via literal comparison. Current methods for such comparison usually rely heavily on labor, which is time consuming and error-prone. With the rapid increase in the number of the standard documents, it is essential to develop an approach to automatize the comparison process. In this work, we proposed a methodology for the computerized comparison of the terms and definitions in standards. Based on the standard structures, the methodology is developed in three steps: the PDF (Portable Document Format) text conversion, the terms and definitions extraction, and comparison. (1) According to the PDF types, either scanned or digitally created, we provided corresponding methods for converting the PDF files. (2) Regarding the terms and definitions extraction, we identified content structures and logical elements, and extracted definition sections and terms. (3) For the comparison step, we evaluated the similarities between the terminologies from the semantic and syntactic aspects. Reliability, availability, maintainability, and safety (RAMS) are crucial attributes to evaluate the performance of a system. In the experimental studies, we compared the RAMS standards issued by IEC, IEEE, ISO, and the Society for Automotive Engineering (SAE). The results show that the proposed methodology is capable of evaluating the similarities of the terms and definitions in standards.
UR - http://www.scopus.com/inward/record.url?scp=85067226777&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85067226777
SN - 2169-8767
VL - 2019
SP - 3254
JO - Proceedings of the International Conference on Industrial Engineering and Operations Management
JF - Proceedings of the International Conference on Industrial Engineering and Operations Management
IS - MAR
T2 - 9th International Conference on Industrial Engineering and Operations Management, IEOM 2019
Y2 - 5 March 2019 through 7 March 2019
ER -