Blog of Sara Jakša

How to Get MBTI Tagged Texts from Reddit

I thought that I might want to return to the MBTI clasification that I was working on last semester. It seems like a good project work to do in this semester as well.

So for that I have decided that this time I might also want to get the data from subreddits that deal with MBTI types.

You can find the jupyter notebook on my github.

I am not yet satisfied, as I realized that people can sometimes write things that I as a human would be able to classify, but I did not catch with this script. Also, for some reason the people on the ESFJ subforum don't really use it. Which gave me an imbalance with ESFJ having the least amount of text.

I am still putting it on the internet in case it helps somebody.