웹 기술의 급속한 발전은 기업들이 관리해야 하는 정보량의 폭발적인 증가를 초래하였다. 이와 더불어 보다 정확한 정보를 찾기 위한 검색 엔진 솔루션 시장의 규모도 더불어 크게 증가하였다. 하지만 대부분의 검색엔진들은 사용자의 검색 의도를 고려하지 않고 사용자가 입력한 특정 키워드를 포함하는 문서들을 반환하는 방법을 채택하고 있어, 실제 사용자가 원하는 정보를 찾는데 까지는 부가적인 시간과 노력이 요구된다. 본 연구에서는 이러한 문제를 해결 하기 위한 중요 기술인 적합성을 만족시키기 위해 재현율과 정확율을 높일 수 있는 방법을 제안하였다. 우선 검색어의 재현율을 높일 수 있도록 유사어 관계 확장을 위한 온톨로지 스키마 모델을 제안하고 이를 기반으로 한 추론을 통해 검색어의 확장을 제시하였다. 확장된 검색어들을 이용하여 문서 검색을 하기 위한 다단계 유사도 검색 순위화 알고리즘을 제안하였다. 설계된 온톨로지 스키마와 온톨로지 저장소의 데이터를 기반으로 추론과 유사도 검색 순위화 엔진이 포함된 웹사이트 형식의 사용자 의도 적응형 검색 솔루션을 구현하였다. 구현된 검색 솔루션을 통해 다양한 검색어를 입력하여 제안 방법의 타당성을 입증하였고 사용자 의도를 고려한 맞춤형 검색 솔루션의 필요성을 설명하였다.
Recently, the rapid progress of a number of standardized web technologies and the proliferation of web users in the world bring an explosive increase of producing and consuming information documents on the web. In addition, most companies have produced, shared, and managed a huge number of information documents that are needed to perform their businesses. They also have discretionally raked, stored and managed a number of web documents published on the web for their business. Along with this increase of information documents that should be managed in the companies, the need of a solution to locate information documents more accurately among a huge number of information sources have increased. In order to satisfy the need of accurate search, the market size of search engine solution market is becoming increasingly expended. The most important functionality among much functionality provided by search engine is to locate accurate information documents from a huge information sources. The major metric to evaluate the accuracy of search engine is relevance that consists of two measures, precision and recall. Precision is thought of as a measure of exactness, that is, what percentage of information considered as true answer are actually such, whereas recall is a measure of completeness, that is, what percentage of true answer are retrieved as such. These two measures can be used differently according to the applied domain. If we need to exhaustively search information such as patent documents and research papers, it is better to increase the recall. On the other hand, when the amount of information is small scale, it is better to increase precision. Most of existing web search engines typically uses a keyword search method that returns web documents including keywords which correspond to search words entered by a user. This method has a virtue of locating all web documents quickly, even though many search words are inputted. However, this method has a fundamental imitation of not considering search intention of a user, thereby retrieving irrelevant results as well as relevant ones. Thus, it takes additional time and effort to set relevant ones out from all results returned by a search engine. That is, keyword search method can increase recall, while it is difficult to locate web documents which a user actually want to find because it does not provide a means of understanding the intention of a user and reflecting it to a progress of searching information. Thus, this research suggests a new method of combining ontology-based search solution with core search functionalities provided by existing search engine solutions. The method enables a search engine to provide optimal search results by inferenceing the search intention of a user. To that end, we build an ontology which contains concepts and relationships among them in a specific domain. The ontology is used to inference synonyms of a set of search keywords inputted by a user, thereby making the search intention of the user reflected into the progress of searching information more actively compared to existing search engines. Based on the proposed method we implement a prototype search system and test the system in the patent domain where we experiment on searching relevant documents associated with a patent. The experiment shows that our system increases the both recall and precision in accuracy and augments the search productivity by using improved user interface that enables a user to interact with our search system effectively. In the future research, we will study a means of validating the better performance of our prototype system by comparing other search engine solution and will extend the applied domain into other domains for searching information such as portal.