In this paper, we propose an agent-based text mining algorithm to extract potential context of papers published in the WWW. A user provides the agent with keywords and assigns a threshold value for each given keyword, the agent in turn attempts to find papers that match the keywords within a defined threshold. To achieve context recognition, the algorithm mines the keywords and identifies the potential context from analysing a paper’s abstract. The mining process entails data cleaning, formatting, filtering, and identifying the candidate keywords. Subsequently, based on the strength of each keyword and the threshold value, the algorithm facilitates the identification of the paper’s potential context.
All Science Journal Classification (ASJC) codes
- Artificial Intelligence