Edraak, an initiative of the Queen Rania Foundation, is the largest MOOC (Massive Online Open Course) in the Arabic language, providing learning opportunities to millions of users. Using Artificial Intelligence (AI) powered solutions from Dae.mn, Edraak is bringing state of the art Machine Learning (ML) to the service of their users, improving the likelihood they will find and engage with appropriate material and ultimately benefit from it.
Edraak’s aim is mainly to be of service to the Arabic speaking community and instrumental in its continued intellectual development. As such, Edraak is constantly seeking ways to connect its users with subject matter that will benefit them and also to ensure its users are successful in their learning.
After an initial discovery phase, Dae.mn investigated Edraak’s platform, data, and approach, and a set of ways in which ML & AI could contribute to Edraak’s core goals were presented. Of these, a handful were chosen to be developed into solutions ready for Edraak to use to the advantage of its users:
Machine Learning is different from other kinds of software development in that there is always more uncertainty at the start, but we know how to manage this uncertainty. Before building prototypes or moving processes into production, we decided how to measure our performance, trained some machine learning models, and ensured that the results were good enough to move on to the next stage.
This machine learning system aims to find the probability that a user will not finish a course in the future. For our dropout prediction system we examined the data available to us carefully to understand how well it reflected the historical record of user actions: can we determine when exactly the dropout occurred? Can we trust this historical data to provide us an insight into how well our system is working? We designed around the gaps in the data to create an online system that adjusts its learning and estimate of performance as newer data becomes available. The dropout prediction is designed to be run as a batch process flowing into further actions to be taken on ‘at risk’ users, such as targeted emails.
For our course recommender system we designed a machine learning model that learnt to predict courses that users are likely to be successful in - for example, the user completing a course - based on the history of the user in the MOOC. The machine learning model tries to learn the following relationship:
Taking this core system and using it in a recommender involved building around it logic to provide sufficient diversity, and balance over different courses, including new courses for which we don’t yet have enough data for the learning system.
We wrapped the trained recommender model using Google Cloud Functions with our recommender logic for ensuring diversity and balance in predictions, providing our own HTTP endpoint.
For both dropout prediction and recommender systems, we used a solution 100% within the Google Cloud ecosystem, using the Google Cloud Platform tools for orchestration, scalable and cheap extraction, transformation and loading of data, machine learning training, post processing, and dashboarding.
We built an end-to-end training pipeline, dashboard and endpoint architecture 100% within Google Cloud. The pipeline and dashboard, similar in both models, as well as the HTTP endpoint for the recommender system, are shown below:
“Working with the Dae.mn team, Edraak is able to utilise Machine Learning & AI to offer a more exciting and engaging experience for our learners.”
Sherif Halawa, Head of Engineering & Research at Edraak