Academia.eduAcademia.edu

Studying Data Mining and Data Warehousing with Different E-Learning System

2013, International Journal of Advanced Computer Science and Applications

(IJACSA) International Journal of Advanced Computer Science and Applications, Vol. 4, No.1, 2013 Studying Data Mining and Data Warehousing with Different E-Learning System Dr. Mohamed F. AlAjmi, Shakir Khan Dr. Arun Sharma PhD Head of Quality and E-Leaning units King Saud University Riyadh, Saudi Arabia M.Sc (Computer Science) Head,Department of Computer Science Krishna Institute of Engineering and Technology, Ghaziabad-201206, INDIA Researcher at King Saud University, Riyadh Saudi Arabia Nationality Indian Abstract----Data Mining and Data Warehousing are two most significant techniques for pattern detection and concentrated data management in present technology. ELearning is one of the most important applications of data mining. The foremost idea is to provide a proposal for a practical model and architecture. The standards and system structural design are analyzed here. This paper provides importance to the combination of Web Services on the e-Learning application domain, because Web Service is the most complex choice for distance education during these days. The process of e-Learning can be promising more efficiently by utilizing of Web usage mining. Mor07/e sophisticated tools are developed for internet customer’s behaviour to boost sales and profit, but no such tools are developed to recognize learner’s performance in e-Learning. In this paper, some data mining techniques are examined that could be used to improve web-based learning environments. Keywords— Data Mining; Data Warehousing; e-Learning; Moodle; LMS; LCMS. I. INTRODUCTION Usually the decision-making data are stored in files and databases. The results getting by huge amount of data are not easy, for which the data mining techniques are very constructive. Data mining is the process of taking out information in terms of patterns or set of laws (e.g. association rules, sequential patterns, classification trees) from huge databases. So, it is also known as data or knowledge discovery. For example, by pulling out demographic data of students’ enrolments, the university, college or any institute could get better the qualitative explanation (e.g. information for past’s students) of database. Any association does not deal with a single database, but deals with various kind of database means multiple databases but there is the need for fast processing, and integrating of these databases which can be possible by data warehouse. Centralizing data management and revival is often distinct as data warehousing. This centralizing helps the user to maximize access to the data and analyzing it. The data warehouse supports different types of analyses, including elaborate queries on large amounts of data that may require extensive searching. When databases are set up for queries on daily transactions, they are called “operational data stores” rather than data warehouse. So, a data warehouse is a storehouse of an organization’s electronically stored data [3]. The mechanisms of data warehouse are: retrieval, extract, analysis, transform, load data and managing data dictionary. Data mining, data warehousing, and Online Analytical Processing (OLAP) together form the functionality of decision making or Decision Support System (DSS). The various areas Eof application of data mining and data warehousing are ecommerce, e governance, online shopping, digital library, online reading, e-learning or e-education, etc. Among these, these days e learning is an important application of data mining. E-Learning is sometimes known as electronic learning or e-learning in which there is no face-to-face interaction between the teacher and the students. Rather than it is webbased learning. It uses Web or Internet technology and delivers digital contents, provides learner oriented environment for teachers and students [4]. So, the environment is not teacher-centric. It may include all types of Technology Enhanced Learning (TEL), where technology is used to support the learning process [5]. For example, in companies, e-Learning is used to deliver training courses to employees and in universities, e- Learning is used for enrolment of students in different courses, provides teaching without any face-to-face interaction, or on-campus facilities, but through internet that is online. As a whole, eLearning includes Distance Learning (DL), Computer Based Teaching (CBT), Computer Aided Instruction (CAI), and Life Long Learning (LLL) principle. So, we see that, e-Learning consists of various types of databases, storing information for user access. To implement e-Learning, data mining can help to construct e-textbook, e-reading, digital libraries, etc. Further scope of e-Learning is blended e- Learning which is a combination of face-to-face interaction and online learning. It incorporates online lectures, tutorials, performance and decision support systems, simulations and games, and more [5]. II. E-LEARNING ARCHITECTURE OR DESIGN A. Functional Model The practical model of an e-Learning structure creates an interface between the mechanisms and the objects of the eLearning system. It is shown in “Fig. 1”. 144 | P a g e www.ijacsa.thesai.org (IJACSA) International Journal of Advanced Computer Science and Applications, Vol. 4, No.1, 2013 UM Store User Modelling System PC1 MOODLE SYSTEM PC2 MOODLE PC3 DATABASESE Fig. 1 Moodle Based E-Learning Architecture The structural design of e-Learning till now does not provide any apparent picture of the e-Learning components. The e-Learning structural design contains two models: the information model and the component model. These two replicas are to be joined and an interface must be defined to attain interoperability. This structural design of e-Learning gives a practical model of the components of e-Learning for the consistency of e-Learning development. The Advanced Distributed Learning (ADL)’s Sharable Content Object Reference Model (SCORM) practical model explains the swap of data within a Learning Content Management System (LCMS) or a Learning Management System (LMS) to track user’s progress. But the functionality is not explained by SCORM. A multi-user atmosphere in which the knowledge developer can create, reuse, manage, store, and distribute digital learning content from a central storehouse is known as LCMS. Here the processes adjoining the learning are managed by LMS. LCMS permits the users to generate and to use again small units of digital instructional learning material. The incorporated use of metadata arrangements and learning object import and export formats also allows learning objects to be created and shared by multiple tools and repositories. LCMS integrates specifications of metadata, content wrapping, and content communication. The components of LCMS are shown in “Fig 1”. LMS needs the interchange of customer profile and customer registration information with other systems. The position of the course choice and the learner action are offered by the LCMS. The mechanisms and information needed are shown in “fig 1”. So, there is an incorporation of LMS and LCMS. Secondly, the SCORM is developed by US Department of Defense’s ADL. This is an “application profile” consisting of a set of terms and conditions. The three main mechanisms of SCORM are: 1) Runtime Environment: The runtime environment is an API describes the interface between learning object and LMS or LCMS to track learner’s progress; 2) Meta-Data: A set of data elements to explain learning contents so that it can simply explore for identified and accessed [7]; 3) Content wrapping: Content wrapping is the release and exchange of structured content i.e. learning objects and courses between different LMS and LCMSs; As a course is separated into lessons, and sometimes the lessons are divided into topics The SCORM condition explains two hierarchical levels: 1) Content aggregation: A group of learning resources to construct complex structures, contents aggregations may be nested and may have lower-level blocks of contents which outline a content aggregation; 2) Resources: Two major types of educating resources are there: SCO and ASSET; The stage at which student interacts with the learning content and also the LMS tracks the results is known as SCO. Basically, it is a learning object. A part of content in form of movie, sound, graphic or other media item is referred as an ASSET. Most ASSETS are started by SCOs as part of their in-house content (e.g. graphics come into view on an HTML page). III. STANDARDS IN E-LEARNING Standards in e-Learning give standardized data structures and communication protocols for e-Learning objects and 145 | P a g e www.ijacsa.thesai.org (IJACSA) International Journal of Advanced Computer Science and Applications, Vol. 4, No.1, 2013 cross-system workflows [1]. The standards are of the following types: 1) Metadata: Metadata refers to the labelling of learning contents and catalogs to maintain indexing, storage, detection (searching), recovery of learning objects by several repositories of data mining and data warehousing techniques. The data utilized here is known as metadata; 2) Content Wrapping: Content Wrapping permits the transport of course content from one learning management system to another learning management system. The most significant content wrapping system these days is, ADL’s SCORM [7]. The facts of the contents are stored in various databases which can be developed and received by data mining and data warehousing techniques; 3) User Profile: User Profile consist personal data, learning history, prerequisites, learning plans, degrees and certifications, evaluation of information and contribution status in existing learning; 4) Student Registration: Student Registration identifies the availability of courses for the learner, also, information about other members of the course. 5) Content Communication: It gives an interface between student data and previous activity after content is started. The message is developed by ADL’s SCORM Object Reference Model. This architecture explains the fundamental thought of scattered e-Learning system means the communication of messages through the communication of web service agents, present in each system. Service Provider is the podium that hosts right to use to the service. It is the server in a clientservice environment. Service Requester is the function that is looking for and calling upon or initiating the communication with a service. Discovery Agency is a searchable set of service explanation where service providers issue their service descriptions. According to Xiaofei Liu, Abdulmotaleb EI Saddik and Nicolas D. Georganas [1], the discovery agency may be centralized or distributed. Information presented by XML concerning learning is wrapped with the Simple Object Access Protocol (SOAP) arrangement and is swapped between requester and provider. A Web Services Description Language (WSDL) file holding the explanation of the message and information regarding end point is published by the provider to permit requester to create the SOAP message and transmit it to the exact destination. IV. BENEFITS OF DATA MINING IN E-LEARNING There are several web usage tools to carry out data mining and data ware housing tasks. For, instance, Two data mining and data ware housing tools are WebSIFT and WebLogMiner for pattern detection from web logs [10][11] but these tools are not initiated in e-Learning environment till now because if the educator does not have sufficient knowledge in data mining, can’t use these tools to get better efficiency of eLearning. Web usage mining is a new system, devoted for e- Learning is being industrialized to permit the educators for online assess activities [9]. It facilitates the educator to follow the activities in the course web site and take out patterns and behaviours, get better or adapt the course content. For example, one could recognize the paths regularly or frequently visited, the paths never visited, etc. By analyzing these general traversal paths of the course content web pages or recurrent changes in individual traversal paths, the design of the course can be known to be better fit the requirements of students. Two types of data mining techniques are used in eLearning: off-line web usage mining and integrated web usage mining. Off-line Web Usage Mining: Off-line web usage mining is the detection of patterns with a separate application. This pattern detection process permits educators to evaluate the access behaviours, legalization of the learning modules, assessment of the learner’s activities, assessment between learners and their access pattern, etc [9]. The model of off-line web mining is a tool for the instructors to apply sequential analysis, association rules and clustering for the detection of relations between the learning actions of learners, interesting prototype of on-line actions and to group parallel access behaviour respectively. So, in off-line web usage mining, incorporated educators can place questions and authenticate the learning models, they utilize as well as the structure of the web site as it is read thoroughly by the learner. It is being observed that off-line web usage mining is a parametric move towards where the parameters are the instructors, educators, learners, etc. Integrated Web Usage Mining: Contrasting to off-line web usage mining, incorporated web usage mining is the procedure of determining patterns incorporated with e-Learning application. This covers adaptive websites, personalization of actions. Also, suggestion of actions to learners according to their favourites along with their history of actions is done by automatic recommenders in incorporated web usage mining. A recommender-based association rule mining is being expanded currently that consists of facts of finding out applicable association between learning performance and creating association rules are recommended to the learner as the suggested next step in the learning session [10]. So, incorporated web usage mining is a non-parametric approach. V. CONCLUSION AND FUTURE WORK In this paper, an obvious analysis of the content state of eLearning standard is being explained. Also, a functional model of dissimilar learning objects is presented here. The swapping of system workflows is also being explained in this paper. Elearning standard gives interoperability between learning systems and tools from several vendors. A standard means of message is set up between dissimilar software applications. This communication is likely by the Web-Services technology. The Web usage mining technique is explained in this paper, which is a non-trivial procedure of taking out helpful and previously unknown blueprints from the use of Web. The data mining techniques to improve e-education are explained in this paper. Since e-Learning process is a endlessly changeable process, the safety services, the encryption of messages, and the general facts to explain services and 146 | P a g e www.ijacsa.thesai.org (IJACSA) International Journal of Advanced Computer Science and Applications, Vol. 4, No.1, 2013 services access points in e-Learning systems environments are in call for thought. Though, several tools using data mining techniques to aid e-Learning system are being developed, the research is still in progress, since the data record given by the Web Servers are inadequate, so there is a call for more specialized logs from the application side to improve the already logged information. REFERENCES Xiaofei Liu, Abdulmotaleb El Saddik and Nicolas D.Georganas”AN IMPLEMENTABLE ARCHITECTURE OF AN E-LEARNING SYSTEMS”. CCECE 2003 – CCGEI 2003 Montreal May/mai 2003. [2] FUNDAMENTALS OF DATABASE SYSTEMS, Fourth Edition, Elmasri and Navathe. [3] Data Mining: What is Data Mining, Web site at http://www.anderson.ucla.edu/faculty/jason.frand/teacher/technologies/p alace/datamining.htm. [4] “Introduction to E-Learning “Website at http://www.chengzhi.net/english/index.htm [5] “E-Learning From Wikipedia to free encyclopedia”, Web site at http://en.wikipedia.org/wiki/E-Learning. [6] C. Romero and S. Ventura , “Data Mining in E-Learning” (WIT Press ,2006). [7] Leopold Kause, Carol Fallon “Creating E-Learning Content in Authorware 7 for SCORM1.2 Compliant LMSs and LCMSs”, Web site at http://adobe.com/resources/elearning/article/l0packager01. [8] “IMS Global Learning Consortium”, Web site at http://www.imsproject.org/ [9] Osmar R. Za¨ıane, “Web Usage Mining for a Better Web-Based Learning Environment”. [10] R. Cooley, B.Mobasher, J.Srivastva, “Web Mining: Information and Pattern Discovery on the World Wide Web”, Procedings of the ninth IEEE international conference on Tools with AI, 1997. [11] O.R. Zaiane, M.Xin, J. Han, Discovering Web Access Patterns and Trends by Applying OLAP and Data Mining Technology on Web Logs, Proceedings from the ADL’98 –Advances in Digital Libraries, Santa Barbara, 1998. [1] [12] E-Learning: A Milestone in the Research of Data Mining; by Sabyasachi Pattnaik, Jui Pattnayak, Priyaranjan Dash http://interscience.in/SpIss_ijcct_icct2010vol2_no234/25_ML AUTHORS PROFILE Dr Mohammed Fahad AlAjmi was born in Kingdom of Saudi Arabia. He received his Ph.D in Pharmacy from King Saud University, Riyadh, Saudi Arabia in 2007.He chaired many position in the university and currently working as vice dean for quality and development in Prince Sultan College for EMS affiliated to King Saud University. To date he taught many pharmacy students, more than 30 courses. Students' level varies from primary to undergraduate levels. Shakir Khan was born on 5th Feb, 1978 at Kallanheri in Saharanpur district UP, India. He is working as a Researcher in College of Electronic Learning in King Saud University, Kingdom of Saudi Arabia. He received his Master of Science in Computer Science from Jamia Hamdard (Hamdard University), New Delhi, India in the year 2005, and PhD computer Science scholar in Manav Bharti University, Solan (HP) India since 2010. He is member of IEEE. He has actively attended many international conferences and published various research papers in National and International conferences as well as journals. His current areas of interests are in Cloud Computing, Software Engineering, Data Mining and E Learning. Apart from that he worked in the field of Software Development in different MNC companies at Noida India Dr. Arun Sharma, alumni of IIT Roorkee and Thapar University, received his M.Tech. (Computer Science and Engineering) from Punjabi University, Patiala, INDIA and Ph.D. (Computer Science) Thapar University, Patiala, INDIA. Currently, he is working as head of the department of Computer Science and Engineering Department in KIET school of engineering and Technology at Ghaziabad, India. His areas of interests include Software Engineering, Soft Computing and Database Systems. He has published a number of papers in international Journals and Conferences including IEEE, ACM, Springer, WILEY and others in India and abroad. Dr. Sharma is an active member of IEEE, ACM and Computer Society of India. He is also a member of Board of Studies (BoS) of Mahamaya Technical University (MTU), Noida. He is also on the panel of subject experts and examination for various Universities like IGNOU, BBA University (Central), Lucknow, GGSIP University, Delhi, Thapar University, Patiala, and others. He is also an active member of Editorial Board and Review Committee of several Journals including Journal of Computer Science (USA), International Journal of Computer Science and Security (Malaysia), Research Journal of Information Technology, USA and others. 147 | P a g e www.ijacsa.thesai.org