Prediction systems and methods are provided. The system obtains a first social media data pertaining to a first set of users, filters the first social media data to obtain a filtered social media data, generates a word embedding matrix including co-occurrence words each represented as a vector having a context, aggregates vectors pertaining each social data to obtain a first set of vectors, and trains machine learning technique(s) (MLTs) using the first set of vectors and context of the first set of vectors. The system further obtains a second social media data pertaining to a second set of users, and performs filtering, word embedding matrix generation, and aggregation operations to obtain a second set of vectors, and further applies the trained MLTs on the second set of vectors and context associated with the second set of vectors to predict age and gender of the second set of users.
展开▼