Last active
March 1, 2025 04:52
-
-
Save lifeinchords/7f96e2728707c28d46f597ea65926a61 to your computer and use it in GitHub Desktop.
pdf-toolbox log file output, when running script with sample book PDF in `[project-root]/data`
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
2025-02-28 13:51:49,642 - PyMuPDF version: 1.25.2 | |
2025-02-28 13:51:49,649 - Pre-warming OpenAI schema cache... | |
2025-02-28 13:51:49,649 - Pre-warming metadata extraction schema... | |
2025-02-28 13:51:56,977 - Pre-warming filename generation call... | |
2025-02-28 13:51:57,379 - Schema cache pre-warming complete | |
2025-02-28 13:51:57,379 - Testing schema caching behavior... | |
2025-02-28 13:51:57,379 - Testing metadata schema caching (3 identical calls): | |
2025-02-28 13:51:58,068 - Call 1: 0.69 seconds | |
2025-02-28 13:51:58,864 - Call 2: 0.80 seconds | |
2025-02-28 13:51:59,475 - Call 3: 0.61 seconds | |
2025-02-28 13:51:59,476 - Testing filename generation caching (3 identical calls): | |
2025-02-28 13:51:59,819 - Call 1: 0.34 seconds | |
2025-02-28 13:52:00,215 - Call 2: 0.40 seconds | |
2025-02-28 13:52:01,162 - Call 3: 0.95 seconds | |
2025-02-28 13:52:01,165 - `````````````````````````````````` | |
2025-02-28 13:52:01,165 - 1 Total PDFs to process | |
2025-02-28 13:52:01,166 - `````````````````````````````````` | |
2025-02-28 13:52:01,166 - Validating PDF files... | |
2025-02-28 13:52:01,209 - 1 Valid | |
2025-02-28 13:52:01,210 - 0 Invalid | |
2025-02-28 13:52:01,210 - Processing valid files... | |
2025-02-28 13:52:01,210 - `````````````````````````````````` | |
2025-02-28 13:52:01,210 - [1] Starting: /Users/[your-user-name]/Code/pdf-toolbox/data/9789819787272.pdf | |
2025-02-28 13:52:01,238 - [1] Processing front matter pages 1 to 8 | |
2025-02-28 13:52:01,244 - [1] Processing pages 1 to 1 of 9789819787272.pdf | |
2025-02-28 13:52:01,273 - [1] Page 1 analysis: text_length=98, image_ratio=0.29 | |
2025-02-28 13:52:01,273 - [1] Page 1 classified as mixed content | |
2025-02-28 13:52:01,274 - [1] Page type determination took 0.03s | |
2025-02-28 13:52:01,274 - [1] Page 1 determined as mixed | |
2025-02-28 13:52:01,370 - [1] Saved page image: logs/20250228_135149/9789819787272/9789819787272_page_1_original.png | |
2025-02-28 13:52:01,370 - [1] PDF Debug Information: | |
2025-02-28 13:52:01,373 - [1] Raw font data: (14737, 'cff', 'Type1', 'MyriadPro-SemiboldCond', 'F1', '') | |
2025-02-28 13:52:01,373 - [1] Raw font data: (14738, 'cff', 'Type1', 'MyriadPro-Cond', 'F2', '') | |
2025-02-28 13:52:01,373 - [1] Raw font data: (14739, 'cff', 'Type1', 'VNMOFJ+MyriadPro-CondIt', 'F3', '') | |
2025-02-28 13:52:01,373 - [1] Number of fonts on page: 3 | |
2025-02-28 13:52:01,474 - [1] Successfully extracted text using dict method | |
2025-02-28 13:52:01,493 - [1] Successfully extracted 15 words from page 1 | |
2025-02-28 13:52:01,493 - [1] Starting OCR for page 1 | |
2025-02-28 13:52:01,819 - [1] Page 1 original OCR confidence: 37.65 | |
2025-02-28 13:52:02,015 - [1] Page 1 enhanced OCR confidence: 24.47 | |
2025-02-28 13:52:02,016 - [1] Using original image for page 1 (confidence: 37.65 > 24.47) | |
2025-02-28 13:52:02,244 - [1] Saved extracted text to: logs/20250228_135149/9789819787272/9789819787272_page_1_text.png.txt | |
2025-02-28 13:52:02,244 - [1] Extracted 18 words from page 1 | |
2025-02-28 13:52:02,249 - [1] Extracting structured text with PyMuPDF4LLM from 9789819787272.pdf (page 1) | |
2025-02-28 13:52:04,473 - [1] Successfully extracted structured markdown (87 characters) | |
2025-02-28 13:52:04,474 - [1] Saved structured markdown to: logs/20250228_135149/9789819787272/9789819787272_page_1_structured.png.md | |
2025-02-28 13:52:04,474 - [1] Detected hierarchy: Title='Grey Systems Analysis', Subtitle='Methods, Models and Applications' | |
2025-02-28 13:52:04,474 - [1] Structured extraction for page 1: Title='Grey Systems Analysis', Subtitle='Methods, Models and Applications' | |
2025-02-28 13:52:04,474 - [1] Title page text to parse: | |
2025-02-28 13:52:04,474 - [1] Series on Grey System | |
2025-02-28 13:52:04,474 - [1] Sifeng Liu | |
2025-02-28 13:52:04,474 - [1] Grey Systems | |
2025-02-28 13:52:04,474 - [1] Analysis | |
2025-02-28 13:52:04,474 - [1] Methods, Models and Applications | |
2025-02-28 13:52:04,474 - [1] Second Edition | |
2025-02-28 13:52:04,474 - [1] Series on Grey System | |
2025-02-28 13:52:04,474 - [1] Sifeng Liu | |
2025-02-28 13:52:04,474 - [1] ; Grey Systems | |
2025-02-28 13:52:04,474 - [1] Analysis | |
2025-02-28 13:52:04,474 - [1] Methods, Models and Applications | |
2025-02-28 13:52:04,474 - [1] Second Edition | |
2025-02-28 13:52:04,474 - [1] g) Springer | |
2025-02-28 13:52:04,474 - [1] Processed pages 1 to 1 of 9789819787272.pdf | |
2025-02-28 13:52:04,474 - [1] Processing pages 2 to 2 of 9789819787272.pdf | |
2025-02-28 13:52:04,481 - [1] Page 2 analysis: text_length=332, image_ratio=0.00 | |
2025-02-28 13:52:04,481 - [1] Page 2 classified as text | |
2025-02-28 13:52:04,482 - [1] Page type determination took 0.01s | |
2025-02-28 13:52:04,482 - [1] Page 2 determined as text | |
2025-02-28 13:52:04,498 - [1] Saved page image: logs/20250228_135149/9789819787272/9789819787272_page_2_original.png | |
2025-02-28 13:52:04,498 - [1] PDF Debug Information: | |
2025-02-28 13:52:04,498 - [1] Raw font data: (4348, 'cff', 'Type1', 'GOJRQX+Times-Bold', 'F1', 'WinAnsiEncoding') | |
2025-02-28 13:52:04,498 - [1] Raw font data: (4349, 'cff', 'Type1', 'FKWFSS+Times-Roman', 'F2', 'MacRomanEncoding') | |
2025-02-28 13:52:04,498 - [1] Number of fonts on page: 2 | |
2025-02-28 13:52:04,499 - [1] Successfully extracted text using dict method | |
2025-02-28 13:52:04,500 - [1] Successfully extracted 45 words from page 2 | |
2025-02-28 13:52:04,501 - [1] Extracting structured text with PyMuPDF4LLM from 9789819787272.pdf (page 2) | |
2025-02-28 13:52:06,633 - [1] Successfully extracted structured markdown (362 characters) | |
2025-02-28 13:52:06,634 - [1] Saved structured markdown to: logs/20250228_135149/9789819787272/9789819787272_page_2_structured.png.md | |
2025-02-28 13:52:06,634 - [1] Detected hierarchy: Title='Series Editors', Subtitle='Sifeng Liu, Institute of Grey Systems Studies, Nanjing University of Aeronautics' | |
2025-02-28 13:52:06,634 - [1] Structured extraction for page 2: Title='Series Editors', Subtitle='Sifeng Liu, Institute of Grey Systems Studies, Nanjing University of Aeronautics' | |
2025-02-28 13:52:06,634 - [1] Title page text to parse: | |
2025-02-28 13:52:06,634 - [1] Series on Grey System | |
2025-02-28 13:52:06,634 - [1] Series Editors | |
2025-02-28 13:52:06,634 - [1] Sifeng Liu, Institute of Grey Systems Studies, Nanjing University of Aeronautics | |
2025-02-28 13:52:06,634 - [1] and Astronautics, Nanjing, Jiangsu, China | |
2025-02-28 13:52:06,634 - [1] Yingjie Yang, Center for Computational Intelligence, De Montfort University, | |
2025-02-28 13:52:06,634 - [1] Leicester, UK | |
2025-02-28 13:52:06,634 - [1] Jeffrey Yi-Lin Forrest, Department of Mathematics, Slippery Rock University, PA, | |
2025-02-28 13:52:06,634 - [1] PA, USA | |
2025-02-28 13:52:06,634 - [1] Processed pages 2 to 2 of 9789819787272.pdf | |
2025-02-28 13:52:06,634 - [1] Processing pages 3 to 3 of 9789819787272.pdf | |
2025-02-28 13:52:06,643 - [1] Page 3 analysis: text_length=1315, image_ratio=0.00 | |
2025-02-28 13:52:06,643 - [1] Page 3 classified as text | |
2025-02-28 13:52:06,643 - [1] Page type determination took 0.01s | |
2025-02-28 13:52:06,643 - [1] Page 3 determined as text | |
2025-02-28 13:52:06,662 - [1] Saved page image: logs/20250228_135149/9789819787272/9789819787272_page_3_original.png | |
2025-02-28 13:52:06,662 - [1] PDF Debug Information: | |
2025-02-28 13:52:06,662 - [1] Raw font data: (4349, 'cff', 'Type1', 'FKWFSS+Times-Roman', 'F2', 'MacRomanEncoding') | |
2025-02-28 13:52:06,662 - [1] Raw font data: (4352, 'cff', 'Type1', 'WFKPEN+MTSYN', 'F3', 'WinAnsiEncoding') | |
2025-02-28 13:52:06,662 - [1] Number of fonts on page: 2 | |
2025-02-28 13:52:06,665 - [1] Successfully extracted text using dict method | |
2025-02-28 13:52:06,669 - [1] Successfully extracted 202 words from page 3 | |
2025-02-28 13:52:06,669 - [1] Extracting structured text with PyMuPDF4LLM from 9789819787272.pdf (page 3) | |
2025-02-28 13:52:08,820 - [1] Successfully extracted structured markdown (1359 characters) | |
2025-02-28 13:52:08,821 - [1] Saved structured markdown to: logs/20250228_135149/9789819787272/9789819787272_page_3_structured.png.md | |
2025-02-28 13:52:08,821 - [1] Detected hierarchy: Title='None', Subtitle='None' | |
2025-02-28 13:52:08,821 - [1] Structured extraction for page 3: Title='None', Subtitle='None' | |
2025-02-28 13:52:08,822 - [1] Title page text to parse: | |
2025-02-28 13:52:08,822 - [1] This series aims to publish books on grey system and various applications in the | |
2025-02-28 13:52:08,822 - [1] fields of natural sciences, social sciences and engineering. | |
2025-02-28 13:52:08,822 - [1] This series is devoted to the international advancement of the theory and appli- | |
2025-02-28 13:52:08,822 - [1] cation of grey system. It seeks to foster professional exchanges between scientists | |
2025-02-28 13:52:08,822 - [1] and practitioners who are interested in the models, methods and applications of grey | |
2025-02-28 13:52:08,822 - [1] system. Through the pioneering work completed over 40 years, grey data analysis | |
2025-02-28 13:52:08,822 - [1] methods have become powerful tools in addressing system with poor information. | |
2025-02-28 13:52:08,822 - [1] Books published with this series will explore the models and applications of grey | |
2025-02-28 13:52:08,822 - [1] system, in order to tackle poor information more effectively and efficiently. The series | |
2025-02-28 13:52:08,822 - [1] aims to provide state-of-the-art information and case studies on new developments | |
2025-02-28 13:52:08,822 - [1] and trends in grey system research and its potential application to solve practical | |
2025-02-28 13:52:08,822 - [1] problems. | |
2025-02-28 13:52:08,822 - [1] Coverage includes, but is not limited to: | |
2025-02-28 13:52:08,822 - [1] • Foundations of grey systems theory | |
2025-02-28 13:52:08,822 - [1] • Grey sequence operators | |
2025-02-28 13:52:08,822 - [1] • Grey relational analysis models | |
2025-02-28 13:52:08,822 - [1] • Grey clustering evaluations models | |
2025-02-28 13:52:08,822 - [1] • Techniques for grey system forecasting | |
2025-02-28 13:52:08,822 - [1] • Grey models for decision-making | |
2025-02-28 13:52:08,822 - [1] • Combined grey models | |
2025-02-28 13:52:08,822 - [1] • Grey input-output models | |
2025-02-28 13:52:08,822 - [1] • Techniques for grey control | |
2025-02-28 13:52:08,822 - [1] • Various applications of grey system models in the fields of natural sciences, social | |
2025-02-28 13:52:08,822 - [1] sciences and engineering. | |
2025-02-28 13:52:08,822 - [1] Processed pages 3 to 3 of 9789819787272.pdf | |
2025-02-28 13:52:08,822 - [1] Processing pages 4 to 4 of 9789819787272.pdf | |
2025-02-28 13:52:08,828 - [1] Page 4 analysis: text_length=77, image_ratio=0.00 | |
2025-02-28 13:52:08,828 - [1] Page 4 classified as text | |
2025-02-28 13:52:08,828 - [1] Page type determination took 0.01s | |
2025-02-28 13:52:08,828 - [1] Page 4 determined as text | |
2025-02-28 13:52:08,841 - [1] Saved page image: logs/20250228_135149/9789819787272/9789819787272_page_4_original.png | |
2025-02-28 13:52:08,841 - [1] PDF Debug Information: | |
2025-02-28 13:52:08,841 - [1] Raw font data: (4349, 'cff', 'Type1', 'FKWFSS+Times-Roman', 'F2', 'MacRomanEncoding') | |
2025-02-28 13:52:08,841 - [1] Number of fonts on page: 1 | |
2025-02-28 13:52:08,842 - [1] Successfully extracted text using dict method | |
2025-02-28 13:52:08,842 - [1] Successfully extracted 11 words from page 4 | |
2025-02-28 13:52:08,843 - [1] Extracting structured text with PyMuPDF4LLM from 9789819787272.pdf (page 4) | |
2025-02-28 13:52:10,996 - [1] Successfully extracted structured markdown (111 characters) | |
2025-02-28 13:52:10,996 - [1] Saved structured markdown to: logs/20250228_135149/9789819787272/9789819787272_page_4_structured.png.md | |
2025-02-28 13:52:10,996 - [1] Detected hierarchy: Title='None', Subtitle='None' | |
2025-02-28 13:52:10,996 - [1] Structured extraction for page 4: Title='None', Subtitle='None' | |
2025-02-28 13:52:10,997 - [1] Title page text to parse: | |
2025-02-28 13:52:10,997 - [1] Sifeng Liu | |
2025-02-28 13:52:10,997 - [1] Grey Systems Analysis | |
2025-02-28 13:52:10,997 - [1] Methods, Models and Applications | |
2025-02-28 13:52:10,997 - [1] Second Edition | |
2025-02-28 13:52:10,997 - [1] Processed pages 4 to 4 of 9789819787272.pdf | |
2025-02-28 13:52:10,997 - [1] Processing pages 5 to 5 of 9789819787272.pdf | |
2025-02-28 13:52:11,010 - [1] Page 5 analysis: text_length=3471, image_ratio=0.00 | |
2025-02-28 13:52:11,011 - [1] Page 5 classified as text | |
2025-02-28 13:52:11,011 - [1] Page type determination took 0.01s | |
2025-02-28 13:52:11,011 - [1] Page 5 determined as text | |
2025-02-28 13:52:11,037 - [1] Saved page image: logs/20250228_135149/9789819787272/9789819787272_page_5_original.png | |
2025-02-28 13:52:11,037 - [1] PDF Debug Information: | |
2025-02-28 13:52:11,038 - [1] Raw font data: (4348, 'cff', 'Type1', 'GOJRQX+Times-Bold', 'F1', 'WinAnsiEncoding') | |
2025-02-28 13:52:11,038 - [1] Raw font data: (4349, 'cff', 'Type1', 'FKWFSS+Times-Roman', 'F2', 'MacRomanEncoding') | |
2025-02-28 13:52:11,038 - [1] Number of fonts on page: 2 | |
2025-02-28 13:52:11,046 - [1] Successfully extracted text using dict method | |
2025-02-28 13:52:11,054 - [1] Successfully extracted 527 words from page 5 | |
2025-02-28 13:52:11,054 - [1] Extracting structured text with PyMuPDF4LLM from 9789819787272.pdf (page 5) | |
2025-02-28 13:52:13,292 - [1] Successfully extracted structured markdown (3589 characters) | |
2025-02-28 13:52:13,292 - [1] Saved structured markdown to: logs/20250228_135149/9789819787272/9789819787272_page_5_structured.png.md | |
2025-02-28 13:52:13,293 - [1] Detected hierarchy: Title='Open Access This book is licensed under the terms of the Creative Commons Attribution-', Subtitle='NonCommercial-NoDerivatives 4.0 International License (http://creativecommons.org/licenses/by-ncnd/4.0/), which permits any noncommercial use, sharing, distribution and reproduction in any medium or' | |
2025-02-28 13:52:13,293 - [1] Structured extraction for page 5: Title='Open Access This book is licensed under the terms of the Creative Commons Attribution-', Subtitle='NonCommercial-NoDerivatives 4.0 International License (http://creativecommons.org/licenses/by-ncnd/4.0/), which permits any noncommercial use, sharing, distribution and reproduction in any medium or' | |
2025-02-28 13:52:13,293 - [1] Title page text to parse: | |
2025-02-28 13:52:13,293 - [1] Sifeng Liu | |
2025-02-28 13:52:13,293 - [1] Center for Grey Systems Studies | |
2025-02-28 13:52:13,293 - [1] Northwestern Polytechnical University | |
2025-02-28 13:52:13,293 - [1] Xi’an, China | |
2025-02-28 13:52:13,293 - [1] ISSN 2731-4936 | |
2025-02-28 13:52:13,293 - [1] ISSN 2731-4944 (electronic) | |
2025-02-28 13:52:13,293 - [1] Series on Grey System | |
2025-02-28 13:52:13,293 - [1] ISBN 978-981-97-8726-5 | |
2025-02-28 13:52:13,293 - [1] ISBN 978-981-97-8727-2 (eBook) | |
2025-02-28 13:52:13,293 - [1] https://doi.org/10.1007/978-981-97-8727-2 | |
2025-02-28 13:52:13,293 - [1] This work was made possible due to projects supported by the national major talent programme of China, | |
2025-02-28 13:52:13,293 - [1] the Marie Curie International Incoming Fellowship of the European Union, the National Natural Science | |
2025-02-28 13:52:13,293 - [1] Foundation of China, the Leverhulme Trust International Network, the joint projects supported by the | |
2025-02-28 13:52:13,293 - [1] NSFC and the RS in the UK, the Fundamental Research Funds for the Central Universities and the | |
2025-02-28 13:52:13,293 - [1] Publishing Fund of Excellence Academic Works of NPU. | |
2025-02-28 13:52:13,293 - [1] 1 st edition: © The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature | |
2025-02-28 13:52:13,293 - [1] Singapore Pte Ltd. 2022 | |
2025-02-28 13:52:13,293 - [1] 2 nd edition: © The Editor(s) (if applicable) and The Author(s) 2025. This book is an open access | |
2025-02-28 13:52:13,293 - [1] publication. | |
2025-02-28 13:52:13,293 - [1] Open Access This book is licensed under the terms of the Creative Commons Attribution- | |
2025-02-28 13:52:13,293 - [1] NonCommercial-NoDerivatives 4.0 International License ( http://creativecommons.org/licenses/by-nc- | |
2025-02-28 13:52:13,293 - [1] nd/4.0/ ), which permits any noncommercial use, sharing, distribution and reproduction in any medium or | |
2025-02-28 13:52:13,293 - [1] format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the | |
2025-02-28 13:52:13,293 - [1] Creative Commons license and indicate if you modified the licensed material. You do not have permission | |
2025-02-28 13:52:13,293 - [1] under this license to share adapted material derived from this book or parts of it. | |
2025-02-28 13:52:13,293 - [1] The images or other third party material in this book are included in the book’s Creative Commons license, | |
2025-02-28 13:52:13,293 - [1] unless indicated otherwise in a credit line to the material. If material is not included in the book’s Creative | |
2025-02-28 13:52:13,293 - [1] Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted | |
2025-02-28 13:52:13,293 - [1] use, you will need to obtain permission directly from the copyright holder. | |
2025-02-28 13:52:13,293 - [1] This work is subject to copyright. All commercial rights are reserved by the author(s), whether the whole | |
2025-02-28 13:52:13,293 - [1] or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, | |
2025-02-28 13:52:13,293 - [1] recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or | |
2025-02-28 13:52:13,293 - [1] information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar | |
2025-02-28 13:52:13,293 - [1] methodology now known or hereafter developed. Regarding these commercial rights a non-exclusive | |
2025-02-28 13:52:13,293 - [1] license has been granted to the publisher. | |
2025-02-28 13:52:13,293 - [1] The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication | |
2025-02-28 13:52:13,293 - [1] does not imply, even in the absence of a specific statement, that such names are exempt from the relevant | |
2025-02-28 13:52:13,293 - [1] protective laws and regulations and therefore free for general use. | |
2025-02-28 13:52:13,293 - [1] The publisher, the authors and the editors are safe to assume that the advice and information in this book | |
2025-02-28 13:52:13,293 - [1] are believed to be true and accurate at the date of publication. Neither the publisher nor the authors or | |
2025-02-28 13:52:13,293 - [1] the editors give a warranty, expressed or implied, with respect to the material contained herein or for any | |
2025-02-28 13:52:13,293 - [1] errors or omissions that may have been made. The publisher remains neutral with regard to jurisdictional | |
2025-02-28 13:52:13,293 - [1] claims in published maps and institutional affiliations. | |
2025-02-28 13:52:13,293 - [1] This Springer imprint is published by the registered company Springer Nature Singapore Pte Ltd. | |
2025-02-28 13:52:13,293 - [1] The registered company address is: 152 Beach Road, #21-01/04 Gateway East, Singapore 189721, | |
2025-02-28 13:52:13,293 - [1] Singapore | |
2025-02-28 13:52:13,293 - [1] If disposing of this product, please recycle the paper. | |
2025-02-28 13:52:13,293 - [1] Found title page data: {'title': 'Sifeng Liu\nCenter for Grey Systems Studies\nNorthwestern Polytechnical University\nXi’an, China\nISSN 2731-4936\nISSN 2731-4944 (electronic)\nSeries on Grey System\nISBN 978-981-97-8726-5\nISBN 978-981-97-8727-2 (eBook)\nhttps://doi.org/10.1007/978-981-97-8727-2\nThis work was made possible due to projects supported', 'author': 'the national major talent programme of China,\nthe Marie Curie International Incoming Fellowship of the European Union, the National Natural Science\nFoundation of China, the Leverhulme Trust International Network, the joint projects supported'} | |
2025-02-28 13:52:13,293 - [1] Processed pages 5 to 5 of 9789819787272.pdf | |
2025-02-28 13:52:13,293 - [1] Processing pages 6 to 6 of 9789819787272.pdf | |
2025-02-28 13:52:13,304 - [1] Page 6 analysis: text_length=2036, image_ratio=0.00 | |
2025-02-28 13:52:13,304 - [1] Page 6 classified as text | |
2025-02-28 13:52:13,304 - [1] Page type determination took 0.01s | |
2025-02-28 13:52:13,305 - [1] Page 6 determined as text | |
2025-02-28 13:52:13,327 - [1] Saved page image: logs/20250228_135149/9789819787272/9789819787272_page_6_original.png | |
2025-02-28 13:52:13,327 - [1] PDF Debug Information: | |
2025-02-28 13:52:13,327 - [1] Raw font data: (4348, 'cff', 'Type1', 'GOJRQX+Times-Bold', 'F1', 'WinAnsiEncoding') | |
2025-02-28 13:52:13,327 - [1] Raw font data: (4349, 'cff', 'Type1', 'FKWFSS+Times-Roman', 'F2', 'MacRomanEncoding') | |
2025-02-28 13:52:13,327 - [1] Number of fonts on page: 2 | |
2025-02-28 13:52:13,332 - [1] Successfully extracted text using dict method | |
2025-02-28 13:52:13,337 - [1] Successfully extracted 329 words from page 6 | |
2025-02-28 13:52:13,337 - [1] Extracting structured text with PyMuPDF4LLM from 9789819787272.pdf (page 6) | |
2025-02-28 13:52:15,495 - [1] Successfully extracted structured markdown (2082 characters) | |
2025-02-28 13:52:15,496 - [1] Saved structured markdown to: logs/20250228_135149/9789819787272/9789819787272_page_6_structured.png.md | |
2025-02-28 13:52:15,496 - [1] Detected hierarchy: Title='None', Subtitle='None' | |
2025-02-28 13:52:15,496 - [1] Structured extraction for page 6: Title='None', Subtitle='None' | |
2025-02-28 13:52:15,496 - [1] Title page text to parse: | |
2025-02-28 13:52:15,496 - [1] Series Preface | |
2025-02-28 13:52:15,496 - [1] This series will publish the books on grey system theory and various applications in | |
2025-02-28 13:52:15,496 - [1] the fields of natural sciences, social sciences and engineering. | |
2025-02-28 13:52:15,496 - [1] It is devoted to the international advancement of the theory and application of | |
2025-02-28 13:52:15,496 - [1] grey system theory, and seeks to foster professional exchanges between scientists | |
2025-02-28 13:52:15,496 - [1] and practitioners who are interested in the models, methods and applications of | |
2025-02-28 13:52:15,496 - [1] grey system theory. Through the pioneering work completed over 40 years, grey | |
2025-02-28 13:52:15,496 - [1] system analysis methods have become powerful tools in addressing system with | |
2025-02-28 13:52:15,496 - [1] poor information. | |
2025-02-28 13:52:15,496 - [1] Books published with this series will explore the models and applications of grey | |
2025-02-28 13:52:15,496 - [1] system theory, in order to tackle poor information more effectively and efficiently. | |
2025-02-28 13:52:15,496 - [1] The series aims to provide state-of-the-art information and case studies on new | |
2025-02-28 13:52:15,496 - [1] developments and trends in grey system research and its potential application to | |
2025-02-28 13:52:15,496 - [1] solve practical problems. | |
2025-02-28 13:52:15,496 - [1] In the era of big data, the grey system theory based on poor information data | |
2025-02-28 13:52:15,496 - [1] mining has sprung up. It has become an effective tool for people to extract valuable | |
2025-02-28 13:52:15,496 - [1] information from massive data. In the past 40 years, grey system method and model | |
2025-02-28 13:52:15,496 - [1] have been widely used in many fields, such as social science, natural science and | |
2025-02-28 13:52:15,496 - [1] engineering technology, which has led to innovation and progress in various fields. | |
2025-02-28 13:52:15,497 - [1] More and more people interested in grey system theory and a lot of new results have | |
2025-02-28 13:52:15,497 - [1] been obtained in recent years. In particular, successful applications in many fields | |
2025-02-28 13:52:15,497 - [1] have won the attention of the international world of learning. | |
2025-02-28 13:52:15,497 - [1] Scholars from more than 100 countries and regions in the world have published | |
2025-02-28 13:52:15,497 - [1] more than 300,000 documents of grey system research and applications. | |
2025-02-28 13:52:15,497 - [1] On the 7th of September, 2019, Angela Dorothea Merkel, then German Chan- | |
2025-02-28 13:52:15,497 - [1] cellor, praised grey system theory in her speech at Huazhong University of Science | |
2025-02-28 13:52:15,497 - [1] and Technology. She said that the work of Prof. Deng Julong, the founder of grey | |
2025-02-28 13:52:15,497 - [1] system theory and Prof. Liu Sifeng, the editor of this series, “have made a profound | |
2025-02-28 13:52:15,497 - [1] impact on the world.” | |
2025-02-28 13:52:15,497 - [1] v | |
2025-02-28 13:52:15,497 - [1] Processed pages 6 to 6 of 9789819787272.pdf | |
2025-02-28 13:52:15,497 - [1] Processing pages 7 to 7 of 9789819787272.pdf | |
2025-02-28 13:52:15,504 - [1] Page 7 analysis: text_length=827, image_ratio=0.00 | |
2025-02-28 13:52:15,505 - [1] Page 7 classified as text | |
2025-02-28 13:52:15,506 - [1] Page type determination took 0.01s | |
2025-02-28 13:52:15,506 - [1] Page 7 determined as text | |
2025-02-28 13:52:15,522 - [1] Saved page image: logs/20250228_135149/9789819787272/9789819787272_page_7_original.png | |
2025-02-28 13:52:15,522 - [1] PDF Debug Information: | |
2025-02-28 13:52:15,523 - [1] Raw font data: (4349, 'cff', 'Type1', 'FKWFSS+Times-Roman', 'F2', 'MacRomanEncoding') | |
2025-02-28 13:52:15,523 - [1] Raw font data: (4352, 'cff', 'Type1', 'WFKPEN+MTSYN', 'F3', 'WinAnsiEncoding') | |
2025-02-28 13:52:15,523 - [1] Number of fonts on page: 2 | |
2025-02-28 13:52:15,525 - [1] Successfully extracted text using dict method | |
2025-02-28 13:52:15,527 - [1] Successfully extracted 133 words from page 7 | |
2025-02-28 13:52:15,527 - [1] Extracting structured text with PyMuPDF4LLM from 9789819787272.pdf (page 7) | |
2025-02-28 13:52:17,664 - [1] Successfully extracted structured markdown (897 characters) | |
2025-02-28 13:52:17,665 - [1] Saved structured markdown to: logs/20250228_135149/9789819787272/9789819787272_page_7_structured.png.md | |
2025-02-28 13:52:17,665 - [1] Detected hierarchy: Title='None', Subtitle='None' | |
2025-02-28 13:52:17,665 - [1] Structured extraction for page 7: Title='None', Subtitle='None' | |
2025-02-28 13:52:17,666 - [1] Title page text to parse: | |
2025-02-28 13:52:17,666 - [1] vi | |
2025-02-28 13:52:17,666 - [1] Series Preface | |
2025-02-28 13:52:17,666 - [1] The Coverage of this series includes, but is not limited to: | |
2025-02-28 13:52:17,666 - [1] • Foundations of grey systems theory | |
2025-02-28 13:52:17,666 - [1] • Grey sequence operators | |
2025-02-28 13:52:17,666 - [1] • Grey relational analysis models | |
2025-02-28 13:52:17,666 - [1] • Grey clustering evaluations models | |
2025-02-28 13:52:17,666 - [1] • Techniques for grey system forecasting | |
2025-02-28 13:52:17,666 - [1] • Grey models for decision-making | |
2025-02-28 13:52:17,666 - [1] • Combined grey models | |
2025-02-28 13:52:17,666 - [1] • Grey input-output models | |
2025-02-28 13:52:17,666 - [1] • Techniques for grey control | |
2025-02-28 13:52:17,666 - [1] • Various applications of grey system models in the fields of natural sciences, social | |
2025-02-28 13:52:17,666 - [1] sciences and engineering. | |
2025-02-28 13:52:17,666 - [1] If you are interested in the series on grey systems, please contact with | |
2025-02-28 13:52:17,666 - [1] Ms. Emily Zhang at [email protected] or Prof. Sifeng Liu at sfliu@ | |
2025-02-28 13:52:17,666 - [1] nwpu.edu.cn . | |
2025-02-28 13:52:17,666 - [1] Xi’an, China | |
2025-02-28 13:52:17,666 - [1] Prof. Sifeng Liu, Ph.D. | |
2025-02-28 13:52:17,666 - [1] Editor of the Book Series on Grey | |
2025-02-28 13:52:17,666 - [1] System, Director of Center for Grey | |
2025-02-28 13:52:17,666 - [1] Systems Studies, NPU, President | |
2025-02-28 13:52:17,666 - [1] of International Association of Grey | |
2025-02-28 13:52:17,666 - [1] System and Uncertain Analysis | |
2025-02-28 13:52:17,666 - [1] Processed pages 7 to 7 of 9789819787272.pdf | |
2025-02-28 13:52:17,666 - [1] Processing pages 8 to 8 of 9789819787272.pdf | |
2025-02-28 13:52:17,677 - [1] Page 8 analysis: text_length=2459, image_ratio=0.00 | |
2025-02-28 13:52:17,677 - [1] Page 8 classified as text | |
2025-02-28 13:52:17,678 - [1] Page type determination took 0.01s | |
2025-02-28 13:52:17,678 - [1] Page 8 determined as text | |
2025-02-28 13:52:17,700 - [1] Saved page image: logs/20250228_135149/9789819787272/9789819787272_page_8_original.png | |
2025-02-28 13:52:17,700 - [1] PDF Debug Information: | |
2025-02-28 13:52:17,701 - [1] Raw font data: (4348, 'cff', 'Type1', 'GOJRQX+Times-Bold', 'F1', 'WinAnsiEncoding') | |
2025-02-28 13:52:17,701 - [1] Raw font data: (4349, 'cff', 'Type1', 'FKWFSS+Times-Roman', 'F2', 'MacRomanEncoding') | |
2025-02-28 13:52:17,701 - [1] Number of fonts on page: 2 | |
2025-02-28 13:52:17,706 - [1] Successfully extracted text using dict method | |
2025-02-28 13:52:17,712 - [1] Successfully extracted 390 words from page 8 | |
2025-02-28 13:52:17,712 - [1] Extracting structured text with PyMuPDF4LLM from 9789819787272.pdf (page 8) | |
2025-02-28 13:52:19,957 - [1] Successfully extracted structured markdown (2501 characters) | |
2025-02-28 13:52:19,958 - [1] Saved structured markdown to: logs/20250228_135149/9789819787272/9789819787272_page_8_structured.png.md | |
2025-02-28 13:52:19,958 - [1] Detected hierarchy: Title='None', Subtitle='None' | |
2025-02-28 13:52:19,958 - [1] Structured extraction for page 8: Title='None', Subtitle='None' | |
2025-02-28 13:52:19,958 - [1] Title page text to parse: | |
2025-02-28 13:52:19,958 - [1] Preface | |
2025-02-28 13:52:19,958 - [1] In this book we answer the calls of the readers of our previous publications, and | |
2025-02-28 13:52:19,958 - [1] systematically present the main advances in grey system theory and applications. By | |
2025-02-28 13:52:19,958 - [1] following our readers’ feedback and suggestions, this volume introduces the most | |
2025-02-28 13:52:19,958 - [1] recent research results and updates on what is presented in our earlier books. In | |
2025-02-28 13:52:19,958 - [1] particular, the following content, which represents the author’s recent research, is | |
2025-02-28 13:52:19,958 - [1] highlighted in the book: general grey numbers and their operations, negative grey | |
2025-02-28 13:52:19,958 - [1] relational analysis models and grey relational analysis models based on similarity and | |
2025-02-28 13:52:19,958 - [1] closeness, three dimensional grey relational analysis models, grey clustering evalu- | |
2025-02-28 13:52:19,958 - [1] ation models based on mixed possibility functions, original difference grey model | |
2025-02-28 13:52:19,958 - [1] (ODGM), even difference grey model (EDGM), discrete grey model (DGM), frac- | |
2025-02-28 13:52:19,958 - [1] tional grey models, self-memory grey models, multi-attribute weighted intelligent | |
2025-02-28 13:52:19,958 - [1] grey target decision models, weight vector group with kernel and weighted compre- | |
2025-02-28 13:52:19,958 - [1] hensive clustering coefficient vector. We also attach a software designed for grey | |
2025-02-28 13:52:19,958 - [1] system modelling, which was developed by Bo Zeng using Visual C#, the widely | |
2025-02-28 13:52:19,958 - [1] employed C/S software tool. This user-friendly software allows users to conveniently | |
2025-02-28 13:52:19,958 - [1] input and/or upload data and clearly distinguish module functions. Also, the software | |
2025-02-28 13:52:19,958 - [1] has the ability to present users with operational details, as well as periodic and partial | |
2025-02-28 13:52:19,958 - [1] results. Additionally, users can adjust the levels of computational accuracy based on | |
2025-02-28 13:52:19,958 - [1] their practical needs. | |
2025-02-28 13:52:19,958 - [1] During the writing of this book, we prioritized theoretical simplicity and clarity to | |
2025-02-28 13:52:19,958 - [1] make it easy for the reader to follow the main arguments made. With a good number | |
2025-02-28 13:52:19,958 - [1] of practical applications, we intended to illustrate the methodology of grey system | |
2025-02-28 13:52:19,958 - [1] theory and modelling techniques so that we could emphasize the practical applica- | |
2025-02-28 13:52:19,958 - [1] bility of grey system thinking. We drew on the most recent research developments | |
2025-02-28 13:52:19,958 - [1] from various research groups around the world and tried to present the most complete | |
2025-02-28 13:52:19,958 - [1] picture of this new area of scientific endeavor in a concise manner. | |
2025-02-28 13:52:19,958 - [1] The overall planning and organization of topics contained in this book were carried | |
2025-02-28 13:52:19,958 - [1] out by Sifeng Liu, who also authored Chaps. 1 , 2 , 4 , 6 , 10 and 12 . Yingjie Yang | |
2025-02-28 13:52:19,958 - [1] produced Chaps. 3 , and 11 , Jeffrey Forrest composed Chaps. 7 and 8 , Naiming | |
2025-02-28 13:52:19,958 - [1] Xie wrote Chap. 9 , and the Appendix and the attached computer software were | |
2025-02-28 13:52:19,958 - [1] developed by Zeng Bo. Zhigeng Fang, Yaoguo Dang, Lirong Jian and Chunhua Su | |
2025-02-28 13:52:19,958 - [1] vii | |
2025-02-28 13:52:19,958 - [1] Found title page data: {'title': 'Preface\nIn this book we answer the calls of the readers of our previous publications, and\nsystematically present the main advances in grey system theory and applications.', 'author': 'following our readers’ feedback and suggestions, this volume introduces the most\nrecent research results and updates on what is presented in our earlier books. In\nparticular, the following content, which represents the author’s recent research, is\nhighlighted in the book: general grey numbers and their operations, negative grey\nrelational analysis models and grey relational analysis models based on similarity and\ncloseness, three dimensional grey relational analysis models, grey clustering evalu-\nation models based on mixed possibility functions, original difference grey model\n(ODGM), even difference grey model (EDGM), discrete grey model (DGM), frac-\ntional grey models, self-memory grey models, multi-attribute weighted intelligent\ngrey target decision models, weight vector group with kernel and weighted compre-\nhensive clustering coefficient vector. We also attach a software designed for grey\nsystem modelling, which was developed'} | |
2025-02-28 13:52:19,958 - [1] Processed pages 8 to 8 of 9789819787272.pdf | |
2025-02-28 13:52:19,959 - [1] Processing complete for 9789819787272.pdf | |
2025-02-28 13:52:19,959 - [1] Total pages processed: 8 | |
2025-02-28 13:52:19,960 - [1] Page types summary: | |
2025-02-28 13:52:19,960 - [1] - Text pages: 7 | |
2025-02-28 13:52:19,960 - [1] - Image pages: 0 | |
2025-02-28 13:52:19,960 - [1] - Mixed pages: 1 | |
2025-02-28 13:52:19,960 - [1] - Unknown pages: 0 | |
2025-02-28 13:52:19,960 - [1] - Error pages: 0 | |
2025-02-28 13:52:19,960 - [1] Processing time: 18.72s | |
2025-02-28 13:52:19,960 - [1] File size: 8.3MB | |
2025-02-28 13:52:19,960 - [1] Avg processing time per page: 2.34 seconds | |
2025-02-28 13:52:19,960 - [1] Skipping back matter processing: mode=never | |
2025-02-28 13:52:19,960 - [1] Combined structured data: Title='Grey Systems Analysis', Subtitle='Methods, Models and Applications' | |
2025-02-28 13:52:19,960 - [1] format_metadata input: | |
2025-02-28 13:52:19,960 - [1] metadata: {'format': 'PDF 1.4', 'producer': 'Springer-i', 'creationDate': "D:20241226162248+05'30'", 'modDate': "D:20241226200022+05'30'"} | |
2025-02-28 13:52:19,960 - [1] field_sources: None | |
2025-02-28 13:52:19,960 - [1] Processing key 'creationDate': | |
2025-02-28 13:52:19,960 - [1] source: (<class 'str'>) | |
2025-02-28 13:52:19,960 - [1] Processing key 'format': | |
2025-02-28 13:52:19,960 - [1] source: (<class 'str'>) | |
2025-02-28 13:52:19,960 - [1] Processing key 'modDate': | |
2025-02-28 13:52:19,960 - [1] source: (<class 'str'>) | |
2025-02-28 13:52:19,960 - [1] Processing key 'producer': | |
2025-02-28 13:52:19,960 - [1] source: (<class 'str'>) | |
2025-02-28 13:52:19,960 - [1] Raw metadata from PDF: | |
2025-02-28 13:52:19,960 - [1] [1] creationDate: D:20241226162248+05'30' | |
2025-02-28 13:52:19,960 - [1] [1] format: PDF 1.4 | |
2025-02-28 13:52:19,960 - [1] [1] modDate: D:20241226200022+05'30' | |
2025-02-28 13:52:19,960 - [1] [1] producer: Springer-i | |
2025-02-28 13:52:19,960 - [1] format_metadata input: | |
2025-02-28 13:52:19,960 - [1] metadata: {'format': 'PDF 1.4', 'producer': 'Springer-i', 'creationDate': "D:20241226162248+05'30'", 'modDate': "D:20241226200022+05'30'"} | |
2025-02-28 13:52:19,960 - [1] field_sources: None | |
2025-02-28 13:52:19,960 - [1] Processing key 'creationDate': | |
2025-02-28 13:52:19,960 - [1] source: (<class 'str'>) | |
2025-02-28 13:52:19,960 - [1] Processing key 'format': | |
2025-02-28 13:52:19,960 - [1] source: (<class 'str'>) | |
2025-02-28 13:52:19,960 - [1] Processing key 'modDate': | |
2025-02-28 13:52:19,960 - [1] source: (<class 'str'>) | |
2025-02-28 13:52:19,960 - [1] Processing key 'producer': | |
2025-02-28 13:52:19,960 - [1] source: (<class 'str'>) | |
2025-02-28 13:52:19,960 - [1] Cleaned metadata before AI: | |
2025-02-28 13:52:19,960 - [1] [1] creationDate: D:20241226162248+05'30' | |
2025-02-28 13:52:19,960 - [1] [1] format: PDF 1.4 | |
2025-02-28 13:52:19,960 - [1] [1] modDate: D:20241226200022+05'30' | |
2025-02-28 13:52:19,960 - [1] [1] producer: Springer-i | |
2025-02-28 13:52:19,963 - [1] Extracted important identifiers from full OCR text: {'isbn': [{'value': 'ISBN 978-981-97-8726-5', 'context': 'ISSN 2731-4944 (electronic)\nSeries on Grey System\nISBN 978-981-97-8726-5\nISBN 978-981-97-8727-2 (eBook)\nhttps://doi.org/10'}, {'value': 'ISBN 978-981-97-8727-2', 'context': 'nic)\nSeries on Grey System\nISBN 978-981-97-8726-5\nISBN 978-981-97-8727-2 (eBook)\nhttps://doi.org/10.1007/978-981-97-8727-2'}, {'value': '978-981-97-8727-2', 'context': '978-981-97-8727-2 (eBook)\nhttps://doi.org/10.1007/978-981-97-8727-2\nThis work was made possible due to projects suppo'}], 'doi': [{'value': '10.1007/978-981-97-8727-2', 'context': '-5\nISBN 978-981-97-8727-2 (eBook)\nhttps://doi.org/10.1007/978-981-97-8727-2\nThis work was made possible due to projects suppo'}]} | |
2025-02-28 13:52:20,127 - [1] Requesting metadata consolidation from AI | |
2025-02-28 13:52:20,127 - [1] RAW AI REQUEST: | |
2025-02-28 13:52:20,127 - [1] [ | |
2025-02-28 13:52:20,127 - [1] { | |
2025-02-28 13:52:20,127 - [1] "role": "system", | |
2025-02-28 13:52:20,127 - [1] "content": "\n You are a world-renowned librarian, \n information scientist, researcher, and data scientist. \n Previously at the Library of Congress. You are now \n a cutting edge AI data scientist. You are contributing \n to open source libraries to make information free. \n You're contracted with us to assist us in generating\n appropriate filenames and metadata based on a careful review of the \n provided materials, including metadata and OCR text.\nYou must return ONLY valid JSON matching the provided schema.\nNever invent or guess values - only use information found in the sources.\nMake sure to properly close all JSON arrays with ] and objects with }.\nThe response must be a complete, valid JSON object.\n\n - Watch out and ignore \"related books from this publisher\" or \"you might also like\" sections\n \n" | |
2025-02-28 13:52:20,127 - [1] }, | |
2025-02-28 13:52:20,127 - [1] { | |
2025-02-28 13:52:20,127 - [1] "role": "user", | |
2025-02-28 13:52:20,127 - [1] "content": "\n INPUT:\n Project Goal: Extract, analyze, and standardize metadata + rename files for a collection of published books and papers.\n \n You are a world-renowned librarian, \n information scientist, researcher, and data scientist. \n Previously at the Library of Congress. You are now \n a cutting edge AI data scientist. You are contributing \n to open source libraries to make information free. \n You're contracted with us to assist us in generating\n appropriate filenames and metadata based on a careful review of the \n provided materials, including metadata and OCR text.\n Here are 3 required and one optional data points, in order of priority, aka, sources:\n - original filename: 9789819787272.pdf\n - original extracted metadata: {\n \"format\": \"PDF 1.4\",\n \"producer\": \"Springer-i\",\n \"creationDate\": \"D:20241226162248+05'30'\",\n \"modDate\": \"D:20241226200022+05'30'\"\n}\n - extracted text from traditional OCR methods of:\n - the first 8 frontmatter pages\n - the last 5 backmatter pages (optional): Series on Grey System\nSifeng Liu\nGrey Systems \nAnalysis\nMethods, Models and Applications\nSecond\u00a0Edition\nSeries on Grey System\n\nSifeng Liu\n\n; Grey Systems\nAnalysis\n\nMethods, Models and Applications\nSecond Edition\n\ng) Springer\n\n\n\nSeries on Grey System\nSeries Editors\nSifeng Liu, Institute of Grey Systems Studies, Nanjing University of Aeronautics\nand Astronautics, Nanjing, Jiangsu, China\nYingjie Yang, Center for Computational Intelligence, De Montfort University,\nLeicester, UK\nJeffrey Yi-Lin Forrest, Department of Mathematics, Slippery Rock University, PA,\nPA, USA\n\nThis series aims to publish books on grey system and various applications in the\n\ufb01elds of natural sciences, social sciences and engineering.\nThis series is devoted to the international advancement of the theory and appli-\ncation of grey system. It seeks to foster professional exchanges between scientists\nand practitioners who are interested in the models, methods and applications of grey\nsystem. Through the pioneering work completed over 40 years, grey data analysis\nmethods have become powerful tools in addressing system with poor information.\nBooks published with this series will explore the models and applications of grey\nsystem, in order to tackle poor information more effectively and ef\ufb01ciently. The series\naims to provide state-of-the-art information and case studies on new developments\nand trends in grey system research and its potential application to solve practical\nproblems.\nCoverage includes, but is not limited to:\n\u2022 Foundations of grey systems theory\n\u2022 Grey sequence operators\n\u2022 Grey relational analysis models\n\u2022 Grey clustering evaluations models\n\u2022 Techniques for grey system forecasting\n\u2022 Grey models for decision-making\n\u2022 Combined grey models\n\u2022 Grey input-output models\n\u2022 Techniques for grey control\n\u2022 Various applications of grey system models in the \ufb01elds of natural sciences, social\nsciences and engineering.\n\nSifeng Liu\nGrey Systems Analysis\nMethods, Models and Applications\nSecond Edition\n\nSifeng Liu\nCenter for Grey Systems Studies\nNorthwestern Polytechnical University\nXi\u2019an, China\nISSN 2731-4936\nISSN 2731-4944 (electronic)\nSeries on Grey System\nISBN 978-981-97-8726-5\nISBN 978-981-97-8727-2 (eBook)\nhttps://doi.org/10.1007/978-981-97-8727-2\nThis work was made possible due to projects supported by the national major talent programme of China,\nthe Marie Curie International Incoming Fellowship of the European Union, the National Natural Science\nFoundation of China, the Leverhulme Trust International Network, the joint projects supported by the\nNSFC and the RS in the UK, the Fundamental Research Funds for the Central Universities and the\nPublishing Fund of Excellence Academic Works of NPU.\n1 st edition: \u00a9 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature\nSingapore Pte Ltd. 2022\n2 nd edition: \u00a9 The Editor(s) (if applicable) and The Author(s) 2025. This book is an open access\npublication.\nOpen Access This book is licensed under the terms of the Creative Commons Attribution-\nNonCommercial-NoDerivatives 4.0 International License ( http://creativecommons.org/licenses/by-nc-\nnd/4.0/ ), which permits any noncommercial use, sharing, distribution and reproduction in any medium or\nformat, as long as you give appropriate credit to the original author(s) and the source, provide a link to the\nCreative Commons license and indicate if you modi\ufb01ed the licensed material. You do not have permission\nunder this license to share adapted material derived from this book or parts of it.\nThe images or other third party material in this book are included in the book\u2019s Creative Commons license,\nunless indicated otherwise in a credit line to the material. If material is not included in the book\u2019s Creative\nCommons license and your intended use is not permitted by statutory regulation or exceeds the permitted\nuse, you will need to obtain permission directly from the copyright holder.\nThis work is subject to copyright. All commercial rights are reserved by the author(s), whether the whole\nor part of the material is concerned, speci\ufb01cally the rights of translation, reprinting, reuse of illustrations,\nrecitation, broadcasting, reproduction on micro\ufb01lms or in any other physical way, and transmission or\ninformation storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar\nmethodology now known or hereafter developed. Regarding these commercial rights a non-exclusive\nlicense has been granted to the publisher.\nThe use of general descriptive names, registered names, trademarks, service marks, etc. in this publication\ndoes not imply, even in the absence of a speci\ufb01c statement, that such names are exempt from the relevant\nprotective laws and regulations and therefore free for general use.\nThe publisher, the authors and the editors are safe to assume that the advice and information in this book\nare believed to be true and accurate at the date of publication. Neither the publisher nor the authors or\nthe editors give a warranty, expressed or implied, with respect to the material contained herein or for any\nerrors or omissions that may have been made. The publisher remains neutral with regard to jurisdictional\nclaims in published maps and institutional af\ufb01liations.\nThis Springer imprint is published by the registered company Springer Nature Singapore Pte Ltd.\nThe registered company address is: 152 Beach Road, #21-01/04 Gateway East, Singapore 189721,\nSingapore\nIf disposing of this product, please recycle the paper.\n\nSeries Preface\nThis series will publish the books on grey system theory and various applications in\nthe \ufb01elds of natural sciences, social sciences and engineering.\nIt is devoted to the international advancement of the theory and application of\ngrey system theory, and seeks to foster professional exchanges between scientists\nand practitioners who are interested in the models, methods and applications of\ngrey system theory. Through the pioneering work completed over 40 years, grey\nsystem analysis methods have become powerful tools in addressing system with\npoor information.\nBooks published with this series will explore the models and applications of grey\nsystem theory, in order to tackle poor information more effectively and ef\ufb01ciently.\nThe series aims to provide state-of-the-art information and case studies on new\ndevelopments and trends in grey system research and its potential application to\nsolve practical problems.\nIn the era of big data, the grey system theory based on poor information data\nmining has sprung up. It has become an effective tool for people to extract valuable\ninformation from massive data. In the past 40 years, grey system method and model\nhave been widely used in many \ufb01elds, such as social science, natural science and\nengineering technology, which has led to innovation and progress in various \ufb01elds.\nMore and more people interested in grey system theory and a lot of new results have\nbeen obtained in recent years. In particular, \n\n[...text truncated...]\n\n--- ISBN Context ---\nISSN 2731-4944 (electronic)\nSeries on Grey System\nISBN 978-981-97-8726-5\nISBN 978-981-97-8727-2 (eBook)\nhttps://doi.org/10\n\n--- ISBN Context ---\nnic)\nSeries on Grey System\nISBN 978-981-97-8726-5\nISBN 978-981-97-8727-2 (eBook)\nhttps://doi.org/10.1007/978-981-97-8727-2\n\n--- ISBN Context ---\n978-981-97-8727-2 (eBook)\nhttps://doi.org/10.1007/978-981-97-8727-2\nThis work was made possible due to projects suppo\n\n--- DOI Context ---\n-5\nISBN 978-981-97-8727-2 (eBook)\nhttps://doi.org/10.1007/978-981-97-8727-2\nThis work was made possible due to projects suppo \n - detailed image title page interpretation from VLM model (optional): Vision analysis unavailable\n - structured text data (optional): {\n \"markdown\": \"# Grey Systems Analysis\\n\\n## Methods, Models and Applications\\n\\n Second Edition\\n\\n\\n-----\\n\\n\",\n \"hierarchy\": {\n \"title\": \"Grey Systems Analysis\",\n \"subtitle\": \"Methods, Models and Applications\"\n },\n \"page_info\": {}\n}\n\n Analyze all of the above and make determinations for all metadata fields.\n NEVER invent or guess values.\n Pay special attention to extracting ALL possible fields from the original filename.\n\n When analyzing structured text data (if provided):\n - The data is in markdown format with preserved document hierarchy based on font sizes and formatting\n - Header levels (# H1, ## H2, etc.) indicate importance and hierarchy in the document\n - The largest headers typically represent titles, followed by subtitles and other hierarchical elements\n - Use this hierarchical information to accurately identify titles, subtitles, and other metadata\n\n For any field where you:\n - Cannot find a value\n - Are unsure about the value\n - Find only partial or ambiguous information\n Return null for that field. Do not return \"Unknown\" or empty strings.\n\n General rules:\n \n - Watch out and ignore \"related books from this publisher\" or \"you might also like\" sections\n \n Our goal is Title Case, for Title, Subtitle, Author, except for words that are: \n - Acronyms (e.g., ISBN, DOI, USA, NATO)\n - Roman numerals (e.g., III, XIV)\n - Compound words with internal caps (e.g., MacBook, iPhone, LaTex)\n\n Field-specific rules:\n \n Title and Subtitle Rules:\n - First, identify all text elements as they appear in the source\n - Then, if any element contains edition information:\n - Move it to the edition field in standardized format\n - REMOVE it completely from its original location (title or subtitle)\n - If this leaves an empty subtitle, set subtitle to null\n - The final result should NEVER contain edition information in both the edition \n field and title/subtitle\n\n Lastly, aside from the edition instruction,keep the title and subtitle \n as they appear in the source.\n\n TEXT QUALITY ASSESSMENT:\n - Before applying structural rules, evaluate the overall quality of the extracted text:\n 1. Determine if the text appears to be accurately captured or contains significant OCR errors\n 2. Check for garbled text, random characters, or text that appears to be from incorrectly OCR'd images\n 3. Assess if the text forms logical, coherent content typical of book front matter\n \n - If text quality is good:\n - Pay special attention to newlines as meaningful structural delimiters\n - Text components separated by newlines often represent hierarchical elements (title, subtitle, author)\n - When a potential title is followed by a newline and then additional text, consider the additional text as a likely subtitle\n \n - If text quality is poor:\n - Rely less on newline structure and more on content patterns\n - Look for recognizable title/subtitle patterns despite formatting issues\n - Prioritize semantic understanding over structural cues\n \n - Always take a balanced approach:\n - Use newlines as a guide, not an absolute rule\n - Consider the overall context\n - When in doubt about a delimiter, evaluate if the content division makes logical sense\n\n Title:\n - Pay special attention to subtitles that look \n like they are part of the title.\n\n Subtitle:\n - Look for common subtitle indicators:\n - If subtitle spans multiple lines, combine them\n - Colon (:) followed by text, most likely a subtitle is after the colon\n - Text after em dash (\u2014) or en dash (\u2013)\n - Explanatory text beginning with common phrases:\n - \"A Study of...\"\n - \"Being a...\"\n - \"An Account of...\"\n - \"A History of...\"\n - \"Proceedings of...\"\n\n SPECIAL CASES:\n - For biographical works:\n - If a person's name appears prominently at the start, it's likely the title\n - Look for a descriptive subtitle that explains the biography\n - Don't rely on words like \"By\" to determine authorship\n - For conference proceedings, use the conference name as subtitle\n\n \n - Author[s] Rules:\n - Include ALL authors when > 1\n - As first name, last name\n - in the order they appear\n - Comma separate\n - Discard \"and\" or \"et al.\"\n\n \n Publisher Rules:\n - Remove corporate designations, like \\s+(LLC|Inc\\.?|Ltd\\.?|Limited|Corporation|Corp\\.?|Co\\.?)(?=[\\s,.]|$)\n - Remove \"The\" \n - Keep case as found in source\n - Remove unnecessary filler words\n - Publisher names should match their original capitalization\n\n \n - Edition Rules:\n - Omit first editions entirely. Keep 2nd and up.\n - Format as ordinal number + \"Ed.\" (e.g. \"2nd Ed.\", \"3rd Ed.\")\n - Key rules: \n - When you find edition info in title/subtitle:\n 1. Move it to this field in standardized format\n 2. Remove it COMPLETELY from title/subtitle\n 3. NEVER duplicate edition information\n - For multiple editions, use most recent\n - Remove filler words (new, revised, printing, etc.)\n - Disregard any vague edition info, like \"Updated\", \"Revised\", \"New\", etc.\n\n Examples:\n Input: \"Programming in C++ Third Edition\"\n Correct:\n title: \"Programming in C++\"\n edition: \"3rd Ed.\"\n \n Input: \"Advanced Physics, Second Edition Updated\"\n Correct:\n title: \"Advanced Physics\"\n edition: \"2nd Ed.\"\n\n - Pattern checks:\n - First editions (to omit): ['(?i)(?:^|[^\\\\w])(1st|first|1)\\\\s*(?:ed|edn|edition|printing)s?\\\\b']\n - Valid editions (2nd and up): ['(?i)(?:^|[^\\\\w]|edition|ed\\\\.?:)\\\\s*([2-9]|[1-9][0-9]+)(?:nd|rd|th)\\\\s*(?:ed|edn|edition|printing)s?\\\\b', '(?i)(?:second|third|fourth|fifth|sixth|seventh|eighth|ninth|tenth)\\\\s+(?:ed|edn|edition|printing)s?\\\\b']\n\n \n - Year Rules:\n - Use latest year, if you find multiple editions, or a range\n - always express as 4 digits\n - If no year found, omit field entirely\n\n \n - Identifiers Rules:\n - For ISBN:\n - Books may have 0, 1, or multiple ISBNs, each with a different medium (hardcover, paperback, digital)\n - Do not use ISBNs that belong to other books\n - Identifier Format: \n - ISBN-10: 10 chars (last can be 'X')\n - ISBN-13: 13 digits (starts 978/979)\n - Medium types:\n Hardcover/Hardback:\n - \"Hardcover\", \"Hard cover\", \"Hardback\", \"Hard back\"\n - \"HBK\", \"HC\", \"HB\", \"Cloth\"\n - \"(hbk)\", \"(hc)\", \"(cloth)\"\n \n Paperback/Softcover:\n - \"Paperback\", \"Paper back\", \"Softcover\", \"Soft cover\"\n - \"PBK\", \"SC\", \"PB\", \"TPB\"\n - \"(pbk)\", \"(sc)\", \"(pb)\"\n \n Electronic/Digital:\n - \"eBook\", \"e-Book\", \"Electronic\", \"Digital\"\n - \"EBK\", \"E-text\"\n - \"(ebk)\", \"(e-book)\", \"(digital)\"\n \n - Patterns indicating other books, and thus to ignore:\n - Sources:\n - \"titles\", \"books\", \"papers\", \"series\", \"works\", \"publications\"\n - Qualifiers:\n - \"other\",\n - \"also\"\n - \"more\", \"more from\",\n - \"similar\",\n - \"related\",\n - \"additional\"\n - \"you might also like\", \"you might like\", \"you may be interested in\"\n - Example combinations of sources and qualifiers:\n \"You might also like other books by [author]\"\n \"Similar books by [publisher]\"\n \"More from [author]\"\n \"Related works by [author]\"\n \"Additional titles by [publisher]\"\n\n \n - DOI/Digital Object Identifier Rules:\n - Format: Must start with \"10.\"\n - Common formats:\n - Standard: 10.XXXX/XXXXX\n - URL format: https://doi.org/10.XXXX/XXXXX\n - With prefix: DOI: 10.XXXX/XXXXX\n - Look for DOIs in:\n - Copyright pages\n - Headers/footers\n - References sections\n\n \n - For LOC/Library of Congress Control Number:\n - Format: \"YYYY[XXXXXX]\"\n\n" | |
2025-02-28 13:52:20,127 - [1] } | |
2025-02-28 13:52:20,127 - [1] ] | |
2025-02-28 13:52:27,904 - [1] RAW AI RESPONSE: | |
2025-02-28 13:52:27,904 - [1] { | |
2025-02-28 13:52:27,905 - [1] "decisions": { | |
2025-02-28 13:52:27,905 - [1] "title": { | |
2025-02-28 13:52:27,905 - [1] "value": "Grey Systems Analysis", | |
2025-02-28 13:52:27,905 - [1] "confidence": "high", | |
2025-02-28 13:52:27,905 - [1] "sources": [ | |
2025-02-28 13:52:27,905 - [1] "extracted text from traditional OCR methods" | |
2025-02-28 13:52:27,905 - [1] ] | |
2025-02-28 13:52:27,905 - [1] }, | |
2025-02-28 13:52:27,905 - [1] "author": { | |
2025-02-28 13:52:27,905 - [1] "value": "Sifeng Liu", | |
2025-02-28 13:52:27,905 - [1] "confidence": "high", | |
2025-02-28 13:52:27,905 - [1] "sources": [ | |
2025-02-28 13:52:27,905 - [1] "extracted text from traditional OCR methods" | |
2025-02-28 13:52:27,905 - [1] ] | |
2025-02-28 13:52:27,905 - [1] }, | |
2025-02-28 13:52:27,905 - [1] "publisher": { | |
2025-02-28 13:52:27,905 - [1] "value": "Springer", | |
2025-02-28 13:52:27,905 - [1] "confidence": "high", | |
2025-02-28 13:52:27,905 - [1] "sources": [ | |
2025-02-28 13:52:27,905 - [1] "original extracted metadata" | |
2025-02-28 13:52:27,905 - [1] ] | |
2025-02-28 13:52:27,905 - [1] }, | |
2025-02-28 13:52:27,905 - [1] "year": { | |
2025-02-28 13:52:27,905 - [1] "value": "2025", | |
2025-02-28 13:52:27,905 - [1] "confidence": "high", | |
2025-02-28 13:52:27,905 - [1] "sources": [ | |
2025-02-28 13:52:27,905 - [1] "extracted text from traditional OCR methods" | |
2025-02-28 13:52:27,905 - [1] ] | |
2025-02-28 13:52:27,905 - [1] }, | |
2025-02-28 13:52:27,905 - [1] "subtitle": { | |
2025-02-28 13:52:27,905 - [1] "value": "Methods, Models and Applications", | |
2025-02-28 13:52:27,905 - [1] "confidence": "high", | |
2025-02-28 13:52:27,905 - [1] "sources": [ | |
2025-02-28 13:52:27,905 - [1] "extracted text from traditional OCR methods" | |
2025-02-28 13:52:27,905 - [1] ] | |
2025-02-28 13:52:27,905 - [1] }, | |
2025-02-28 13:52:27,905 - [1] "edition": { | |
2025-02-28 13:52:27,905 - [1] "value": "2nd Ed.", | |
2025-02-28 13:52:27,905 - [1] "confidence": "high", | |
2025-02-28 13:52:27,905 - [1] "sources": [ | |
2025-02-28 13:52:27,905 - [1] "extracted text from traditional OCR methods" | |
2025-02-28 13:52:27,905 - [1] ] | |
2025-02-28 13:52:27,905 - [1] }, | |
2025-02-28 13:52:27,905 - [1] "isbn": [ | |
2025-02-28 13:52:27,905 - [1] { | |
2025-02-28 13:52:27,905 - [1] "value": "978-981-97-8727-2", | |
2025-02-28 13:52:27,906 - [1] "medium": "eBook", | |
2025-02-28 13:52:27,906 - [1] "confidence": "high", | |
2025-02-28 13:52:27,906 - [1] "sources": [ | |
2025-02-28 13:52:27,906 - [1] "extracted text from traditional OCR methods" | |
2025-02-28 13:52:27,906 - [1] ] | |
2025-02-28 13:52:27,906 - [1] }, | |
2025-02-28 13:52:27,906 - [1] { | |
2025-02-28 13:52:27,906 - [1] "value": "978-981-97-8726-5", | |
2025-02-28 13:52:27,906 - [1] "medium": "Hardcover", | |
2025-02-28 13:52:27,906 - [1] "confidence": "high", | |
2025-02-28 13:52:27,906 - [1] "sources": [ | |
2025-02-28 13:52:27,906 - [1] "extracted text from traditional OCR methods" | |
2025-02-28 13:52:27,906 - [1] ] | |
2025-02-28 13:52:27,906 - [1] } | |
2025-02-28 13:52:27,906 - [1] ], | |
2025-02-28 13:52:27,906 - [1] "doi": { | |
2025-02-28 13:52:27,906 - [1] "value": "10.1007/978-981-97-8727-2", | |
2025-02-28 13:52:27,906 - [1] "confidence": "high", | |
2025-02-28 13:52:27,906 - [1] "sources": [ | |
2025-02-28 13:52:27,906 - [1] "extracted text from traditional OCR methods" | |
2025-02-28 13:52:27,906 - [1] ] | |
2025-02-28 13:52:27,906 - [1] }, | |
2025-02-28 13:52:27,906 - [1] "loc": { | |
2025-02-28 13:52:27,906 - [1] "value": "null", | |
2025-02-28 13:52:27,906 - [1] "confidence": "low", | |
2025-02-28 13:52:27,906 - [1] "sources": [] | |
2025-02-28 13:52:27,906 - [1] } | |
2025-02-28 13:52:27,906 - [1] }, | |
2025-02-28 13:52:27,906 - [1] "reasoning": { | |
2025-02-28 13:52:27,906 - [1] "title": "The title was clearly identified from the OCR text as 'Grey Systems Analysis'. No conflicts were found.", | |
2025-02-28 13:52:27,906 - [1] "author": "The author was identified as 'Sifeng Liu' from the OCR text. No conflicts were found.", | |
2025-02-28 13:52:27,906 - [1] "publisher": "The publisher was identified as 'Springer' from the original extracted metadata. No conflicts were found.", | |
2025-02-28 13:52:27,906 - [1] "year": "The year '2025' was identified from the OCR text, indicating the publication date of the second edition. No conflicts were found.", | |
2025-02-28 13:52:27,906 - [1] "subtitle": "The subtitle 'Methods, Models and Applications' was clearly identified from the OCR text. No conflicts were found.", | |
2025-02-28 13:52:27,906 - [1] "edition": "The edition was identified as '2nd Ed.' from the OCR text, which was explicitly stated. No conflicts were found.", | |
2025-02-28 13:52:27,906 - [1] "isbn": "Two ISBNs were identified: '978-981-97-8727-2' for the eBook and '978-981-97-8726-5' for the hardcover edition, both from the OCR text. No conflicts were found.", | |
2025-02-28 13:52:27,906 - [1] "doi": "The DOI '10.1007/978-981-97-8727-2' was clearly identified from the OCR text. No conflicts were found.", | |
2025-02-28 13:52:27,906 - [1] "loc": "No Library of Congress Control Number was found in the provided materials." | |
2025-02-28 13:52:27,906 - [1] } | |
2025-02-28 13:52:27,906 - [1] } | |
2025-02-28 13:52:27,907 - [1] format_metadata input: | |
2025-02-28 13:52:27,907 - [1] metadata: {'decisions': {'title': {'value': 'Grey Systems Analysis', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'author': {'value': 'Sifeng Liu', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'publisher': {'value': 'Springer', 'confidence': 'high', 'sources': ['original extracted metadata']}, 'year': {'value': '2025', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'subtitle': {'value': 'Methods, Models and Applications', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'edition': {'value': '2nd Ed.', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'isbn': [{'value': '978-981-97-8727-2', 'medium': 'eBook', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, {'value': '978-981-97-8726-5', 'medium': 'Hardcover', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}], 'doi': {'value': '10.1007/978-981-97-8727-2', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'loc': {'value': 'null', 'confidence': 'low', 'sources': []}}, 'reasoning': {'title': "The title was clearly identified from the OCR text as 'Grey Systems Analysis'. No conflicts were found.", 'author': "The author was identified as 'Sifeng Liu' from the OCR text. No conflicts were found.", 'publisher': "The publisher was identified as 'Springer' from the original extracted metadata. No conflicts were found.", 'year': "The year '2025' was identified from the OCR text, indicating the publication date of the second edition. No conflicts were found.", 'subtitle': "The subtitle 'Methods, Models and Applications' was clearly identified from the OCR text. No conflicts were found.", 'edition': "The edition was identified as '2nd Ed.' from the OCR text, which was explicitly stated. No conflicts were found.", 'isbn': "Two ISBNs were identified: '978-981-97-8727-2' for the eBook and '978-981-97-8726-5' for the hardcover edition, both from the OCR text. No conflicts were found.", 'doi': "The DOI '10.1007/978-981-97-8727-2' was clearly identified from the OCR text. No conflicts were found.", 'loc': 'No Library of Congress Control Number was found in the provided materials.'}} | |
2025-02-28 13:52:27,907 - [1] field_sources: None | |
2025-02-28 13:52:27,907 - [1] Processing key 'author': | |
2025-02-28 13:52:27,907 - [1] source: (<class 'str'>) | |
2025-02-28 13:52:27,907 - [1] decision_data: {'value': 'Sifeng Liu', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']} | |
2025-02-28 13:52:27,907 - [1] decision_sources: ['extracted text from traditional OCR methods'] (<class 'list'>) | |
2025-02-28 13:52:27,907 - [1] Processing key 'doi': | |
2025-02-28 13:52:27,907 - [1] source: (<class 'str'>) | |
2025-02-28 13:52:27,907 - [1] decision_data: {'value': '10.1007/978-981-97-8727-2', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']} | |
2025-02-28 13:52:27,907 - [1] decision_sources: ['extracted text from traditional OCR methods'] (<class 'list'>) | |
2025-02-28 13:52:27,907 - [1] Processing key 'edition': | |
2025-02-28 13:52:27,907 - [1] source: (<class 'str'>) | |
2025-02-28 13:52:27,907 - [1] decision_data: {'value': '2nd Ed.', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']} | |
2025-02-28 13:52:27,907 - [1] decision_sources: ['extracted text from traditional OCR methods'] (<class 'list'>) | |
2025-02-28 13:52:27,907 - [1] Processing key 'isbn': | |
2025-02-28 13:52:27,907 - [1] source: (<class 'str'>) | |
2025-02-28 13:52:27,907 - [1] decision_data: [{'value': '978-981-97-8727-2', 'medium': 'eBook', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, {'value': '978-981-97-8726-5', 'medium': 'Hardcover', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}] | |
2025-02-28 13:52:27,907 - [1] Processing key 'loc': | |
2025-02-28 13:52:27,907 - [1] source: (<class 'str'>) | |
2025-02-28 13:52:27,907 - [1] decision_data: {'value': 'null', 'confidence': 'low', 'sources': []} | |
2025-02-28 13:52:27,907 - [1] decision_sources: [] (<class 'list'>) | |
2025-02-28 13:52:27,907 - [1] Processing key 'publisher': | |
2025-02-28 13:52:27,907 - [1] source: (<class 'str'>) | |
2025-02-28 13:52:27,907 - [1] decision_data: {'value': 'Springer', 'confidence': 'high', 'sources': ['original extracted metadata']} | |
2025-02-28 13:52:27,907 - [1] decision_sources: ['original extracted metadata'] (<class 'list'>) | |
2025-02-28 13:52:27,907 - [1] Processing key 'subtitle': | |
2025-02-28 13:52:27,907 - [1] source: (<class 'str'>) | |
2025-02-28 13:52:27,907 - [1] decision_data: {'value': 'Methods, Models and Applications', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']} | |
2025-02-28 13:52:27,907 - [1] decision_sources: ['extracted text from traditional OCR methods'] (<class 'list'>) | |
2025-02-28 13:52:27,907 - [1] Processing key 'title': | |
2025-02-28 13:52:27,907 - [1] source: (<class 'str'>) | |
2025-02-28 13:52:27,907 - [1] decision_data: {'value': 'Grey Systems Analysis', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']} | |
2025-02-28 13:52:27,907 - [1] decision_sources: ['extracted text from traditional OCR methods'] (<class 'list'>) | |
2025-02-28 13:52:27,907 - [1] Processing key 'year': | |
2025-02-28 13:52:27,907 - [1] source: (<class 'str'>) | |
2025-02-28 13:52:27,907 - [1] decision_data: {'value': '2025', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']} | |
2025-02-28 13:52:27,907 - [1] decision_sources: ['extracted text from traditional OCR methods'] (<class 'list'>) | |
2025-02-28 13:52:27,907 - [1] Raw response from AI: | |
2025-02-28 13:52:27,907 - [1] [1] author: Sifeng Liu :: extracted text from traditional OCR methods | |
2025-02-28 13:52:27,907 - [1] [1] doi: 10.1007/978-981-97-8727-2 :: extracted text from traditional OCR methods | |
2025-02-28 13:52:27,907 - [1] [1] edition: 2nd Ed. :: extracted text from traditional OCR methods | |
2025-02-28 13:52:27,907 - [1] [1] isbn: [{'value': '978-981-97-8727-2', 'medium': 'eBook', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, {'value': '978-981-97-8726-5', 'medium': 'Hardcover', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}] | |
2025-02-28 13:52:27,907 - [1] [1] loc: null | |
2025-02-28 13:52:27,907 - [1] [1] publisher: Springer :: original extracted metadata | |
2025-02-28 13:52:27,907 - [1] [1] subtitle: Methods, Models and Applications :: extracted text from traditional OCR methods | |
2025-02-28 13:52:27,907 - [1] [1] title: Grey Systems Analysis :: extracted text from traditional OCR methods | |
2025-02-28 13:52:27,907 - [1] [1] year: 2025 :: extracted text from traditional OCR methods | |
2025-02-28 13:52:27,907 - [1] Cost calculation: 3909i + 721o tokens = $0.0010 | |
2025-02-28 13:52:27,908 - [1] Processing decisions from AI response: <class 'dict'> | |
2025-02-28 13:52:27,908 - [1] Processing field 'title': type=<class 'dict'>, value={'value': 'Grey Systems Analysis', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']} | |
2025-02-28 13:52:27,908 - [1] Field 'title' value: type=<class 'str'>, value=Grey Systems Analysis | |
2025-02-28 13:52:27,908 - [1] Processing field 'author': type=<class 'dict'>, value={'value': 'Sifeng Liu', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']} | |
2025-02-28 13:52:27,908 - [1] Field 'author' value: type=<class 'str'>, value=Sifeng Liu | |
2025-02-28 13:52:27,908 - [1] Processing field 'publisher': type=<class 'dict'>, value={'value': 'Springer', 'confidence': 'high', 'sources': ['original extracted metadata']} | |
2025-02-28 13:52:27,908 - [1] Field 'publisher' value: type=<class 'str'>, value=Springer | |
2025-02-28 13:52:27,908 - [1] Processing field 'year': type=<class 'dict'>, value={'value': '2025', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']} | |
2025-02-28 13:52:27,908 - [1] Field 'year' value: type=<class 'str'>, value=2025 | |
2025-02-28 13:52:27,908 - [1] Processing field 'subtitle': type=<class 'dict'>, value={'value': 'Methods, Models and Applications', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']} | |
2025-02-28 13:52:27,908 - [1] Field 'subtitle' value: type=<class 'str'>, value=Methods, Models and Applications | |
2025-02-28 13:52:27,908 - [1] Processing field 'edition': type=<class 'dict'>, value={'value': '2nd Ed.', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']} | |
2025-02-28 13:52:27,908 - [1] Field 'edition' value: type=<class 'str'>, value=2nd Ed. | |
2025-02-28 13:52:27,908 - [1] Processing field 'isbn': type=<class 'list'>, value=[{'value': '978-981-97-8727-2', 'medium': 'eBook', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, {'value': '978-981-97-8726-5', 'medium': 'Hardcover', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}] | |
2025-02-28 13:52:27,908 - [1] Processing field 'doi': type=<class 'dict'>, value={'value': '10.1007/978-981-97-8727-2', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']} | |
2025-02-28 13:52:27,908 - [1] Field 'doi' value: type=<class 'str'>, value=10.1007/978-981-97-8727-2 | |
2025-02-28 13:52:27,908 - [1] Processing field 'loc': type=<class 'dict'>, value={'value': 'null', 'confidence': 'low', 'sources': []} | |
2025-02-28 13:52:27,908 - [1] Field 'loc' value: type=<class 'str'>, value=null | |
2025-02-28 13:52:27,908 - [1] About to process metadata response. parsed type=<class 'dict'> | |
2025-02-28 13:52:27,908 - [1] Processing decisions: type=<class 'dict'>, value={'title': {'value': 'Grey Systems Analysis', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'author': {'value': 'Sifeng Liu', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'publisher': {'value': 'Springer', 'confidence': 'high', 'sources': ['original extracted metadata']}, 'year': {'value': '2025', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'subtitle': {'value': 'Methods, Models and Applications', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'edition': {'value': '2nd Ed.', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'isbn': [{'value': '978-981-97-8727-2', 'medium': 'eBook', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, {'value': '978-981-97-8726-5', 'medium': 'Hardcover', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}], 'doi': {'value': '10.1007/978-981-97-8727-2', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'loc': {'value': 'null', 'confidence': 'low', 'sources': []}} | |
2025-02-28 13:52:27,908 - [1] Checking required field 'author' | |
2025-02-28 13:52:27,909 - [1] Field 'author' data: type=<class 'dict'>, value={'value': 'Sifeng Liu', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']} | |
2025-02-28 13:52:27,909 - [1] Checking required field 'title' | |
2025-02-28 13:52:27,909 - [1] Field 'title' data: type=<class 'dict'>, value={'value': 'Grey Systems Analysis', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']} | |
2025-02-28 13:52:27,909 - [1] Processing optional field 'publisher': type=<class 'dict'> | |
2025-02-28 13:52:27,909 - [1] Processing optional field 'year': type=<class 'dict'> | |
2025-02-28 13:52:27,909 - [1] Processing optional field 'subtitle': type=<class 'dict'> | |
2025-02-28 13:52:27,909 - [1] Processing optional field 'edition': type=<class 'dict'> | |
2025-02-28 13:52:27,909 - [1] Processing optional field 'isbn': type=<class 'list'> | |
2025-02-28 13:52:27,909 - [1] Processing optional field 'doi': type=<class 'dict'> | |
2025-02-28 13:52:27,909 - [1] Processing optional field 'loc': type=<class 'dict'> | |
2025-02-28 13:52:27,910 - [1] About to clean metadata. processed_metadata type=<class 'dict'> | |
2025-02-28 13:52:27,910 - [1] Formatting authors: Sifeng Liu | |
2025-02-28 13:52:27,911 - [1] Metadata sources used: | |
2025-02-28 13:52:27,911 - [1] title: extracted text from traditional OCR methods (confidence: high) | |
2025-02-28 13:52:27,911 - [1] author: extracted text from traditional OCR methods (confidence: high) | |
2025-02-28 13:52:27,911 - [1] publisher: original extracted metadata (confidence: high) | |
2025-02-28 13:52:27,911 - [1] year: extracted text from traditional OCR methods (confidence: high) | |
2025-02-28 13:52:27,911 - [1] subtitle: extracted text from traditional OCR methods (confidence: high) | |
2025-02-28 13:52:27,911 - [1] edition: extracted text from traditional OCR methods (confidence: high) | |
2025-02-28 13:52:27,911 - [1] doi: extracted text from traditional OCR methods (confidence: high) | |
2025-02-28 13:52:27,911 - [1] loc: (confidence: low) | |
2025-02-28 13:52:27,911 - [1] Tokens: I: 3909 O: 721 | |
2025-02-28 13:52:27,911 - [1] Cost: $0.0010 | |
2025-02-28 13:52:27,912 - [1] Starting filename generation with metadata: | |
2025-02-28 13:52:27,912 - [1] Raw metadata type: <class 'dict'> | |
2025-02-28 13:52:27,912 - [1] Raw metadata: {'decisions': {'author': {'value': 'Sifeng Liu', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'title': {'value': 'Grey Systems Analysis', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'publisher': {'value': 'Springer', 'confidence': 'high', 'sources': ['original extracted metadata']}, 'year': {'value': '2025', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'subtitle': {'value': 'Methods, Models and Applications', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'edition': {'value': '2nd Ed.', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'isbn': [{'value': '9789819787272', 'medium': 'ebk', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, {'value': '9789819787265', 'medium': 'hbk', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}], 'doi': {'value': '10.1007/978-981-97-8727-2', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}}, 'reasoning': {'title': "The title was clearly identified from the OCR text as 'Grey Systems Analysis'. No conflicts were found.", 'author': "The author was identified as 'Sifeng Liu' from the OCR text. No conflicts were found.", 'publisher': "The publisher was identified as 'Springer' from the original extracted metadata. No conflicts were found.", 'year': "The year '2025' was identified from the OCR text, indicating the publication date of the second edition. No conflicts were found.", 'subtitle': "The subtitle 'Methods, Models and Applications' was clearly identified from the OCR text. No conflicts were found.", 'edition': "The edition was identified as '2nd Ed.' from the OCR text, which was explicitly stated. No conflicts were found.", 'isbn': "Two ISBNs were identified: '978-981-97-8727-2' for the eBook and '978-981-97-8726-5' for the hardcover edition, both from the OCR text. No conflicts were found.", 'doi': "The DOI '10.1007/978-981-97-8727-2' was clearly identified from the OCR text. No conflicts were found.", 'loc': 'No Library of Congress Control Number was found in the provided materials.'}, 'author': 'Sifeng Liu', 'title': 'Grey Systems Analysis', 'publisher': 'Springer', 'year': '2025', 'subtitle': 'Methods, Models and Applications', 'edition': '2nd Ed.', 'isbn': [{'value': '9789819787272', 'medium': 'ebk', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, {'value': '9789819787265', 'medium': 'hbk', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}], 'doi': '10.1007/978-981-97-8727-2'} | |
2025-02-28 13:52:27,912 - [1] Converting metadata values to strings: | |
2025-02-28 13:52:27,912 - [1] decisions: type=<class 'dict'>, value={'author': {'value': 'Sifeng Liu', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'title': {'value': 'Grey Systems Analysis', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'publisher': {'value': 'Springer', 'confidence': 'high', 'sources': ['original extracted metadata']}, 'year': {'value': '2025', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'subtitle': {'value': 'Methods, Models and Applications', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'edition': {'value': '2nd Ed.', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'isbn': [{'value': '9789819787272', 'medium': 'ebk', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, {'value': '9789819787265', 'medium': 'hbk', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}], 'doi': {'value': '10.1007/978-981-97-8727-2', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}} | |
2025-02-28 13:52:27,912 - [1] reasoning: type=<class 'dict'>, value={'title': "The title was clearly identified from the OCR text as 'Grey Systems Analysis'. No conflicts were found.", 'author': "The author was identified as 'Sifeng Liu' from the OCR text. No conflicts were found.", 'publisher': "The publisher was identified as 'Springer' from the original extracted metadata. No conflicts were found.", 'year': "The year '2025' was identified from the OCR text, indicating the publication date of the second edition. No conflicts were found.", 'subtitle': "The subtitle 'Methods, Models and Applications' was clearly identified from the OCR text. No conflicts were found.", 'edition': "The edition was identified as '2nd Ed.' from the OCR text, which was explicitly stated. No conflicts were found.", 'isbn': "Two ISBNs were identified: '978-981-97-8727-2' for the eBook and '978-981-97-8726-5' for the hardcover edition, both from the OCR text. No conflicts were found.", 'doi': "The DOI '10.1007/978-981-97-8727-2' was clearly identified from the OCR text. No conflicts were found.", 'loc': 'No Library of Congress Control Number was found in the provided materials.'} | |
2025-02-28 13:52:27,912 - [1] author: type=<class 'str'>, value=Sifeng Liu | |
2025-02-28 13:52:27,912 - [1] title: type=<class 'str'>, value=Grey Systems Analysis | |
2025-02-28 13:52:27,912 - [1] publisher: type=<class 'str'>, value=Springer | |
2025-02-28 13:52:27,912 - [1] year: type=<class 'str'>, value=2025 | |
2025-02-28 13:52:27,912 - [1] subtitle: type=<class 'str'>, value=Methods, Models and Applications | |
2025-02-28 13:52:27,912 - [1] edition: type=<class 'str'>, value=2nd Ed. | |
2025-02-28 13:52:27,912 - [1] isbn: type=<class 'list'>, value=[{'value': '9789819787272', 'medium': 'ebk', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, {'value': '9789819787265', 'medium': 'hbk', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}] | |
2025-02-28 13:52:27,912 - [1] doi: type=<class 'str'>, value=10.1007/978-981-97-8727-2 | |
2025-02-28 13:52:27,912 - [1] decisions: type=<class 'str'>, value={'author': {'value': 'Sifeng Liu', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'title': {'value': 'Grey Systems Analysis', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'publisher': {'value': 'Springer', 'confidence': 'high', 'sources': ['original extracted metadata']}, 'year': {'value': '2025', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'subtitle': {'value': 'Methods, Models and Applications', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'edition': {'value': '2nd Ed.', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'isbn': [{'value': '9789819787272', 'medium': 'ebk', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, {'value': '9789819787265', 'medium': 'hbk', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}], 'doi': {'value': '10.1007/978-981-97-8727-2', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}} | |
2025-02-28 13:52:27,912 - [1] reasoning: type=<class 'str'>, value={'title': "The title was clearly identified from the OCR text as 'Grey Systems Analysis'. No conflicts were found.", 'author': "The author was identified as 'Sifeng Liu' from the OCR text. No conflicts were found.", 'publisher': "The publisher was identified as 'Springer' from the original extracted metadata. No conflicts were found.", 'year': "The year '2025' was identified from the OCR text, indicating the publication date of the second edition. No conflicts were found.", 'subtitle': "The subtitle 'Methods, Models and Applications' was clearly identified from the OCR text. No conflicts were found.", 'edition': "The edition was identified as '2nd Ed.' from the OCR text, which was explicitly stated. No conflicts were found.", 'isbn': "Two ISBNs were identified: '978-981-97-8727-2' for the eBook and '978-981-97-8726-5' for the hardcover edition, both from the OCR text. No conflicts were found.", 'doi': "The DOI '10.1007/978-981-97-8727-2' was clearly identified from the OCR text. No conflicts were found.", 'loc': 'No Library of Congress Control Number was found in the provided materials.'} | |
2025-02-28 13:52:27,912 - [1] author: type=<class 'str'>, value=Sifeng Liu | |
2025-02-28 13:52:27,912 - [1] title: type=<class 'str'>, value=Grey Systems Analysis | |
2025-02-28 13:52:27,912 - [1] publisher: type=<class 'str'>, value=Springer | |
2025-02-28 13:52:27,912 - [1] year: type=<class 'str'>, value=2025 | |
2025-02-28 13:52:27,912 - [1] subtitle: type=<class 'str'>, value=Methods, Models and Applications | |
2025-02-28 13:52:27,912 - [1] edition: type=<class 'str'>, value=2nd Ed. | |
2025-02-28 13:52:27,912 - [1] isbn: type=<class 'str'>, value=[{'value': '9789819787272', 'medium': 'ebk', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, {'value': '9789819787265', 'medium': 'hbk', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}] | |
2025-02-28 13:52:27,912 - [1] doi: type=<class 'str'>, value=10.1007/978-981-97-8727-2 | |
2025-02-28 13:52:27,916 - [1] Starting filename generation with gpt-4o-mini | |
2025-02-28 13:52:27,916 - [1] Using OpenAI GPT model for filename generation | |
2025-02-28 13:52:28,608 - [1] Cleaning up filename | |
2025-02-28 13:52:28,609 - [1] Processing complete for 9789819787272.pdf | |
2025-02-28 13:52:28,609 - [1] Total pages in document: 419 | |
2025-02-28 13:52:28,609 - [1] Pages processed for OCR/text: 8 | |
2025-02-28 13:52:28,609 - [1] Validation status: All pages validated | |
2025-02-28 13:52:28,609 - [1] Page types summary: | |
2025-02-28 13:52:28,609 - [1] - Text pages: 7 | |
2025-02-28 13:52:28,609 - [1] - Image pages: 0 | |
2025-02-28 13:52:28,609 - [1] - Unknown pages: 0 | |
2025-02-28 13:52:28,609 - [1] - Error pages: 0 | |
2025-02-28 13:52:28,609 - [1] 0m 27s | |
2025-02-28 13:52:28,609 - [1] File size: 8.3MB | |
2025-02-28 13:52:28,609 - [1] Tokens: Input: 3909, Output: 721 | |
2025-02-28 13:52:28,653 - [1] Extracted 0 annotations from ./data/9789819787272.pdf | |
2025-02-28 13:52:28,653 - [1] Has annotations: no | |
2025-02-28 13:52:28,666 - [1] File CRC32: 74140e38 | |
2025-02-28 13:52:28,666 - [1] Metadata CRC32: 70682d14 | |
2025-02-28 13:52:28,666 - [1] Current: 9789819787272.pdf | |
2025-02-28 13:52:28,666 - [1] Proposed: Grey Systems Analysis, Methods, Models and Applications, (Sifeng Liu), Springer, (2nd Ed.), (2025).pdf | |
2025-02-28 13:52:28,666 - [1] format_metadata input: | |
2025-02-28 13:52:28,666 - [1] metadata: {'decisions': {'author': {'value': 'Sifeng Liu', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'title': {'value': 'Grey Systems Analysis', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'publisher': {'value': 'Springer', 'confidence': 'high', 'sources': ['original extracted metadata']}, 'year': {'value': '2025', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'subtitle': {'value': 'Methods, Models and Applications', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'edition': {'value': '2nd Ed.', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, 'isbn': [{'value': '9789819787272', 'medium': 'ebk', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, {'value': '9789819787265', 'medium': 'hbk', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}], 'doi': {'value': '10.1007/978-981-97-8727-2', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}}, 'reasoning': {'title': "The title was clearly identified from the OCR text as 'Grey Systems Analysis'. No conflicts were found.", 'author': "The author was identified as 'Sifeng Liu' from the OCR text. No conflicts were found.", 'publisher': "The publisher was identified as 'Springer' from the original extracted metadata. No conflicts were found.", 'year': "The year '2025' was identified from the OCR text, indicating the publication date of the second edition. No conflicts were found.", 'subtitle': "The subtitle 'Methods, Models and Applications' was clearly identified from the OCR text. No conflicts were found.", 'edition': "The edition was identified as '2nd Ed.' from the OCR text, which was explicitly stated. No conflicts were found.", 'isbn': "Two ISBNs were identified: '978-981-97-8727-2' for the eBook and '978-981-97-8726-5' for the hardcover edition, both from the OCR text. No conflicts were found.", 'doi': "The DOI '10.1007/978-981-97-8727-2' was clearly identified from the OCR text. No conflicts were found.", 'loc': 'No Library of Congress Control Number was found in the provided materials.'}, 'author': 'Sifeng Liu', 'title': 'Grey Systems Analysis', 'publisher': 'Springer', 'year': '2025', 'subtitle': 'Methods, Models and Applications', 'edition': '2nd Ed.', 'isbn': [{'value': '9789819787272', 'medium': 'ebk', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, {'value': '9789819787265', 'medium': 'hbk', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}], 'doi': '10.1007/978-981-97-8727-2'} | |
2025-02-28 13:52:28,666 - [1] field_sources: {'author': ['extracted text from traditional OCR methods'], 'title': ['extracted text from traditional OCR methods'], 'publisher': ['original extracted metadata'], 'year': ['extracted text from traditional OCR methods'], 'subtitle': ['extracted text from traditional OCR methods'], 'edition': ['extracted text from traditional OCR methods'], 'isbn': ['extracted text from traditional OCR methods'], 'doi': ['extracted text from traditional OCR methods']} | |
2025-02-28 13:52:28,666 - [1] Processing key 'author': | |
2025-02-28 13:52:28,666 - [1] source: ['extracted text from traditional OCR methods'] (<class 'list'>) | |
2025-02-28 13:52:28,666 - [1] decision_data: {'value': 'Sifeng Liu', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']} | |
2025-02-28 13:52:28,667 - [1] decision_sources: ['extracted text from traditional OCR methods'] (<class 'list'>) | |
2025-02-28 13:52:28,667 - [1] Processing key 'doi': | |
2025-02-28 13:52:28,667 - [1] source: ['extracted text from traditional OCR methods'] (<class 'list'>) | |
2025-02-28 13:52:28,667 - [1] decision_data: {'value': '10.1007/978-981-97-8727-2', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']} | |
2025-02-28 13:52:28,667 - [1] decision_sources: ['extracted text from traditional OCR methods'] (<class 'list'>) | |
2025-02-28 13:52:28,667 - [1] Processing key 'edition': | |
2025-02-28 13:52:28,667 - [1] source: ['extracted text from traditional OCR methods'] (<class 'list'>) | |
2025-02-28 13:52:28,667 - [1] decision_data: {'value': '2nd Ed.', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']} | |
2025-02-28 13:52:28,667 - [1] decision_sources: ['extracted text from traditional OCR methods'] (<class 'list'>) | |
2025-02-28 13:52:28,667 - [1] Processing key 'isbn': | |
2025-02-28 13:52:28,667 - [1] source: ['extracted text from traditional OCR methods'] (<class 'list'>) | |
2025-02-28 13:52:28,667 - [1] decision_data: [{'value': '9789819787272', 'medium': 'ebk', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, {'value': '9789819787265', 'medium': 'hbk', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}] | |
2025-02-28 13:52:28,667 - [1] Processing key 'publisher': | |
2025-02-28 13:52:28,667 - [1] source: ['original extracted metadata'] (<class 'list'>) | |
2025-02-28 13:52:28,667 - [1] decision_data: {'value': 'Springer', 'confidence': 'high', 'sources': ['original extracted metadata']} | |
2025-02-28 13:52:28,667 - [1] decision_sources: ['original extracted metadata'] (<class 'list'>) | |
2025-02-28 13:52:28,667 - [1] Processing key 'subtitle': | |
2025-02-28 13:52:28,667 - [1] source: ['extracted text from traditional OCR methods'] (<class 'list'>) | |
2025-02-28 13:52:28,667 - [1] decision_data: {'value': 'Methods, Models and Applications', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']} | |
2025-02-28 13:52:28,667 - [1] decision_sources: ['extracted text from traditional OCR methods'] (<class 'list'>) | |
2025-02-28 13:52:28,667 - [1] Processing key 'title': | |
2025-02-28 13:52:28,667 - [1] source: ['extracted text from traditional OCR methods'] (<class 'list'>) | |
2025-02-28 13:52:28,667 - [1] decision_data: {'value': 'Grey Systems Analysis', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']} | |
2025-02-28 13:52:28,667 - [1] decision_sources: ['extracted text from traditional OCR methods'] (<class 'list'>) | |
2025-02-28 13:52:28,667 - [1] Processing key 'year': | |
2025-02-28 13:52:28,667 - [1] source: ['extracted text from traditional OCR methods'] (<class 'list'>) | |
2025-02-28 13:52:28,667 - [1] decision_data: {'value': '2025', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']} | |
2025-02-28 13:52:28,667 - [1] decision_sources: ['extracted text from traditional OCR methods'] (<class 'list'>) | |
2025-02-28 13:52:28,667 - [1] author: Sifeng Liu :: extracted text from traditional OCR methods | |
2025-02-28 13:52:28,667 - [1] doi: 10.1007/978-981-97-8727-2 :: extracted text from traditional OCR methods | |
2025-02-28 13:52:28,667 - [1] edition: 2nd Ed. :: extracted text from traditional OCR methods | |
2025-02-28 13:52:28,667 - [1] isbn: [{'value': '9789819787272', 'medium': 'ebk', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}, {'value': '9789819787265', 'medium': 'hbk', 'confidence': 'high', 'sources': ['extracted text from traditional OCR methods']}] :: extracted text from traditional OCR methods | |
2025-02-28 13:52:28,667 - [1] publisher: Springer :: original extracted metadata | |
2025-02-28 13:52:28,667 - [1] subtitle: Methods, Models and Applications :: extracted text from traditional OCR methods | |
2025-02-28 13:52:28,667 - [1] title: Grey Systems Analysis :: extracted text from traditional OCR methods | |
2025-02-28 13:52:28,667 - [1] year: 2025 :: extracted text from traditional OCR methods | |
2025-02-28 13:52:28,667 - [1] 8.3mb First 8 of 419pg {'text': 7, 'image': 0, 'mixed': 1, 'unknown': 0, 'error': 0} | |
2025-02-28 13:52:28,667 - [1] Tokens: I: 3909 O: 721 | |
2025-02-28 13:52:28,667 - [1] $0.001019 | |
2025-02-28 13:52:28,667 - [1] 0m 27s | |
2025-02-28 13:52:28,667 - [1] Reasoning: | |
2025-02-28 13:52:28,667 - [1] title: The title was clearly identified from the OCR text as 'Grey Systems Analysis'. No conflicts were found. | |
2025-02-28 13:52:28,667 - [1] author: The author was identified as 'Sifeng Liu' from the OCR text. No conflicts were found. | |
2025-02-28 13:52:28,667 - [1] publisher: The publisher was identified as 'Springer' from the original extracted metadata. No conflicts were found. | |
2025-02-28 13:52:28,667 - [1] year: The year '2025' was identified from the OCR text, indicating the publication date of the second edition. No conflicts were found. | |
2025-02-28 13:52:28,667 - [1] subtitle: The subtitle 'Methods, Models and Applications' was clearly identified from the OCR text. No conflicts were found. | |
2025-02-28 13:52:28,668 - [1] edition: The edition was identified as '2nd Ed.' from the OCR text, which was explicitly stated. No conflicts were found. | |
2025-02-28 13:52:28,668 - [1] isbn: Two ISBNs were identified: '978-981-97-8727-2' for the eBook and '978-981-97-8726-5' for the hardcover edition, both from the OCR text. No conflicts were found. | |
2025-02-28 13:52:28,668 - [1] doi: The DOI '10.1007/978-981-97-8727-2' was clearly identified from the OCR text. No conflicts were found. | |
2025-02-28 13:52:28,668 - [1] loc: No Library of Congress Control Number was found in the provided materials. | |
2025-02-28 13:52:28,668 - [1] `````````````````````````````````` | |
2025-02-28 13:52:28,668 - Summary: | |
2025-02-28 13:52:28,668 - Valid: 1 | |
2025-02-28 13:52:28,668 - Invalid: 0 | |
2025-02-28 13:52:28,668 - Successful: 1 | |
2025-02-28 13:52:28,668 - Failed: 0 | |
2025-02-28 13:52:28,668 - With Annotations: 0 | |
2025-02-28 13:52:28,668 - Files with API Timeouts: 0 | |
2025-02-28 13:52:28,668 - Total: | |
2025-02-28 13:52:28,668 - 0m 27s | |
2025-02-28 13:52:28,668 - Tokens: Input: 3909, Output: 721 | |
2025-02-28 13:52:28,668 - $0.001019 | |
2025-02-28 13:52:28,668 - Average per PDF: | |
2025-02-28 13:52:28,668 - 0m 27s | |
2025-02-28 13:52:28,668 - Tokens: Input: 3909.00, Output: 721.00 | |
2025-02-28 13:52:28,668 - $0.001019 | |
2025-02-28 13:52:28,678 - `````````````````````````````````` |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment