사용자 관심 이슈 분석을 통한 추천시스템 성능 향상 방안

최성이; 현윤진; 김남규

추천

검색

질문

자료유형: 학술저널

저자정보: 최성이 (국민대학교) 현윤진 (국민대학교) 김남규 (국민대학교)

저널정보: 한국지능정보시스템학회 지능정보연구 지능정보연구 제21권 제3호

발행연도: 2015.9

수록면: 101 - 116 (16page)

이용수

📌

연구주제

📖

연구배경

🔬

연구방법

🏆

연구결과

이 논문의 연구 히스토리 (2)

2016

사용자 관심 이슈 분석을 통한 추천시스템 성능 향상 방안

최성이 비즈니스IT 2016.01 학위논문

2015

사용자 관심 이슈 분석을 통한 추천시스템 성능 향상 방안

최성이 , 현윤진 , 김남규 지능정보연구 2015.09 학술저널

이 논문의 후속연구가 궁금하신가요?
연관 학술논문 또는 학술발표를 통해 보다 발전된 연구결과를 확인하실 수 있습니다.
이 논문의 연구 히스토리 확인하기

초록· 키워드

오류제보하기

많은 기관들이 데이터에 기반을 둔 의사결정을 수행해 왔으며, 특히 수치자료를 비롯한 정형 데이터가 이러한 목적으로 널리 활용되어 왔다. 하지만 최근에는 스마트기기와 소셜미디어의 발달로 인해 다양한 형태를 가진 방대한 양의 정보가 생성, 공유, 저장되면서, 전통적인 정형 데이터 기반 의사결정으로부터 비정형 빅데이터 기반 의사결정으로 관심의 전환이 이루어지고 있다. 데이터 기반 의사결정의 대표적 분야인 추천시스템 분야에서도 성능 향상을 위해 비정형 데이터를 활용해야 한다는 필요성이 최근 꾸준히 제기되고 있다. 특히 사용자의 성향이나 선호도는 고객의 니즈와 직결되기 때문에, 비정형 데이터 분석을 통해 사용자의 성향을 파악하고 이를 통해 상품 추천 및 구매 예측의 정확도를 향상시키기 위한 노력이 매우 시급하게 이루어질 필요가 있다. 따라서 본 연구에서는 사용자의 성향을 측정하여 재구매 예측 정확도, 특히 카테고리별 재구매 예측 정확도를 높임으로써, 궁극적으로 추천시스템의 성능을 향상시킬 수 있는 방안을 제시한다. 구체적으로는 사용자의 일상적인 인터넷 사용 기록을 분석하여 고객이 조회하는 뉴스 기사의 이슈를 식별하고 다양한 이슈에 대한 고객의 관심을 계량화한 후, 이를 활용하여 고객의 카테고리별 재구매 여부를 예측하는 모델을 제안하고자 한다. 실제 웹 트랜잭션으로부터 도출된 인터넷 뉴스 조회 기록 및 쇼핑몰 구매 기록을 대상으로 실험을 수행한 결과, 고객의 과거 구매이력만을 활용한 카테고리 재구매 예측 모형에 비해 본 연구에서 제안한 모형, 즉 고객의 과거 구매이력과 관심 이슈를 모두 활용한 예측 모형의 정확도가 다소 우수한 것으로 나타났다.

Recently, due to the development of smart devices and social media, vast amounts of information with the various forms were accumulated. Particularly, considerable research efforts are being directed towards analyzing unstructured big data to resolve various social problems. Accordingly, focus of data-driven decision-making is being moved from structured data analysis to unstructured one. Also, in the field of recommendation system, which is the typical area of data-driven decision-making, the need of using unstructured data has been steadily increased to improve system performance. Approaches to improve the performance of recommendation systems can be found in two aspects- improving algorithms and acquiring useful data with high quality. Traditionally, most efforts to improve the performance of recommendation system were made by the former approach, while the latter approach has not attracted much attention relatively. In this sense, efforts to utilize unstructured data from variable sources are very timely and necessary. Particularly, as the interests of users are directly connected with their needs, identifying the interests of the user through unstructured big data analysis can be a crew for improving performance of recommendation systems. In this sense, this study proposes the methodology of improving recommendation system by measuring interests of the user. Specially, this study proposes the method to quantify interests of the user by analyzing user’s internet usage patterns, and to predict user’s repurchase based upon the discovered preferences.
There are two important modules in this study. The first module predicts repurchase probability of each category through analyzing users’ purchase history. We include the first module to our research scope for comparing the accuracy of traditional purchase-based prediction model to our new model presented in the second module. This procedure extracts purchase history of users. The core part of our methodology is in the second module. This module extracts users’ interests by analyzing news articles the users have read. The second module constructs a correspondence matrix between topics and news articles by performing topic modeling on real world news articles. And then, the module analyzes users’ news access patterns and then constructs a correspondence matrix between articles and users. After that, by merging the results of the previous processes in the second module, we can obtain a correspondence matrix between users and topics. This matrix describes users’ interests in a structured manner. Finally, by using the matrix, the second module builds a model for predicting repurchase probability of each category.
In this paper, we also provide experimental results of our performance evaluation. The outline of data used our experiments is as follows. We acquired web transaction data of 5,000 panels from a company that is specialized to analyzing ranks of internet sites. At first we extracted 15,000 URLs of news articles published from July 2012 to June 2013 from the original data and we crawled main contents of the news articles. After that we selected 2,615 users who have read at least one of the extracted news articles. Among the 2,615 users, we discovered that the number of target users who purchase at least one items from our target shopping mall ‘G’ is 359. In the experiments, we analyzed purchase history and news access records of the 359 internet users. From the performance evaluation, we found that our prediction model using both users’ interests and purchase history outperforms a prediction model using only users’ purchase history from a view point of misclassification ratio. In detail, our model outperformed the traditional one in appliance, beauty, computer, culture, digital, fashion, and sports categories when artificial neural network based models were used. Similarly, our model outperformed the traditional one in beauty, computer, digital, fashion, food, and furniture categories when decision tree based models were used although the improvement is very small.

#데이터 마이닝 #빅데이터 분석 #추천시스템 #텍스트 마이닝 #토픽 분석 #Big Data Analysis #Data Mining #Recommendation Systems #Text Mining #Topic Modeling

참고문헌 (24)

참고문헌 신청

[학술대회논문] - Aciar, S. / 2006 / Recommender System Based on Consumer Product Reviews / WI '06 Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence table of contents : 719 ~ 723

[학술지(정기간행물)] - 안현철 / 2014 / 사용자 감정 예측을 통한 상황인지 추천시스템의 개선 / Journal of Information Technology Applications & Management 21 (4) : 203 ~ 223

[학술지(정기간행물)] - 안성만 / 2012 / 소셜네트워크 분석을 통한 협업필터링 추천 성과의 이해 / 한국전자거래학회지 17 (2) : 129 ~ 147

[학술지(정기간행물)] - Armentano, M. G / 2013 / Followee Recommendation Based on Text Analysis of Micro-blogging Activity / Information Systems 38 (8) : 1116 ~ 1127

[학술지(정기간행물)] - Balabanovic, M / 1997 / Fab:Content-Based, Collaborative Recommendation / Communication of the ACM 40 (3) : 66 ~ 72

함께 읽어보면 좋을 논문

논문 유사도에 따라 DBpia 가 추천하는 논문입니다. 함께 보면 좋을 연관 논문을 확인해보세요!

이 논문의 저자 정보

최성이

소속기관 국민대학교

주요연구분야 공학 > 산업공학 사회과학 > 경영학

논문수 3 이용수 1,040

현윤진

소속기관 Graduate School of Business IT, Kookmin University

주요연구분야 사회과학 > 경영학 공학 > 산업공학

논문수 11 이용수 4,069

김남규

소속기관 국민대학교

주요연구분야 공학 > 산업공학 TOP 1% 공학 > 전기전자공학 > 정보통신공학

논문수 96 이용수 23,372

이 논문과 함께 이용한 논문

빅데이터 기반 추천시스템 구현을 위한 다중 프로파일 앙상블 기법

김민정 , 조윤호 지능정보연구 2015 .12

텍스트 분석을 활용한 인터넷 쇼핑몰의 카테고리별 재구매 예측 모형

최성이 , 현윤진 , 김남규 한국지능정보시스템학회 학술대회논문집 2015 .11

영역별 맞춤형 감성사전 구축을 통한 영화리뷰 감성분석

이상훈 , 최정 , 김종우 지능정보연구 2016 .06

비정형 텍스트 분석을 활용한 이슈의 동적 변이과정 고찰

임명수 , 김남규 지능정보연구 2016 .03

빅데이터 기반 추천시스템 구현을 위한 다중 프로파일 앙상블 기법

김민정 , 조윤호 한국지능정보시스템학회 학술대회논문집 2015 .11

최근 본 자료

전체보기

UCI(KEPA) : I410-ECN-0101-2016-003-001985446

구분	그룹	데이터 항목
AI 학습용 데이터	원문	원문 PDF 파일
AI 학습용 데이터	원문 + 메타 (기본/상세)	원문 PDF 파일 및 서지정보 CSV
대량 구매용 데이터	B2B 구독 방식	특정 자료 한정으로 원문 접근 권한 부여
대량 구매용 데이터	URL 전달 방식	바로 PDF 뷰어를 열람할 수 있는 URL 제공

구분	그룹	데이터 항목
AI 학습용 데이터	기본 메타	발행기관명, 간행물명, 권호명, 권(vol), 호(issue), 통권, 발행연도, 발행월, 논문명, 저자명, 시작페이지, 종료페이지, 전체페이지, 상세페이지URL
상세 메타 데이터	발행기관 메타	발행기관 이명, 영문명, 창립연도, 홈페이지URL, 발행기관 소개
	간행물 메타	부제목, 간행물 유형, ISSN, ISBN, 최초발행연도, 폐간연도, 간행빈도, 발행주기, 등재사항, 이용수, 피인용수, 권호수, 논문수, 표지이미지
	논문 메타	작성 언어, 부제목, 대등제목, 목차, 키워드, 초록, 이미지, 참고문헌, 이용수, 피인용수, 논문활용도, DBpia통합주제분류, KDC분류, DDC분류, 한국연구재단분류, UCI, DOI
	저자 메타	소속기관, 소속부서, 직급, 연구분야, 연구키워드, 이용수, 피인용수, 저자 논문활용도

구분	그룹	데이터 항목
※ 결합형/맞춤형 메타 데이터는 신청 내용에 따라 다양하게 제공 가능
이용순위 정보	주제분야별 많이 이용된 논문	“인문학”에서 많이 이용된 논문 TOP100
	이용기관별 많이 이용된 논문	“중고등학교”에서 많이 이용된 논문 TOP100
	세부기관별 많이 이용된 논문	“서울대학교”에서 많이 이용된 논문 TOP100
	키워드별 많이 이용된 논문	“Chat GPT”에서 많이 이용된 논문 TOP100
키워드 정보	많이 이용된 키워드	특정기간/분야/저널 내 많이 이용된 키워드
	많이 발행된 키워드	특정기간/분야/저널 내 많이 발행된 키워드
	많이 검색된 키워드	특정기간/분야/저널 내 많이 검색된 키워드
	연구 트렌드 키워드	특정 키워드 연관 연구동향 분석 데이터 키워드

논문 기본 정보

이 논문의 연구 히스토리 (2)

초록· 키워드

AI 요약

연구주제

연구배경

연구방법

연구결과

주요내용

목차

참고문헌 (24)

함께 읽어보면 좋을 논문

이 논문의 저자 정보

이 논문과 함께 이용한 논문

최근 본 자료

댓글(0)