Programming Languages by Age
9.2. Programming Languages by Age#
What are the most popular languages for a specific age group? We can find that out using data extracted from Diffbot. Here is the code used to extract the data from Diffbot.
I uploaded the dataset used in this tutorial to this repository so that you can try out the dataset yourself.
!pip install observable_jupyter
from observable_jupyter import embed
import pandas as pd
df = pd.read_csv(
"https://media.githubusercontent.com/media/khuyentran1401/dataset/master/programming/languages_by_age.csv",
index_col=0,
)
df.head(10)
born_year | language | count | |
---|---|---|---|
0 | 2001-01-01 | Abap | 12 |
1 | 2001-01-01 | Ada | 2 |
2 | 2001-01-01 | C | 0 |
3 | 2001-01-01 | C# | 644 |
4 | 2001-01-01 | Cobol | 9 |
5 | 2001-01-01 | Dart | 29 |
6 | 2001-01-01 | Go | 44 |
7 | 2001-01-01 | Groovy | 6 |
8 | 2001-01-01 | Haskell | 10 |
9 | 2001-01-01 | Java | 1946 |
We will visualize the dataset using the bar chart race created on Observable. The year on the bottom left represents the birth year. Each bar represents how popular each language is for a particular birth year.
Languages in the race: Python, Java, JavaScript, C#, C++, PHP, R, Objective-C, TypeScript, Swift, Kotlin, Matlab, Go, Rust, VBA, Ruby, Scala, Ada, Visual Basic, Dart, Lua, Cobol, Groovy, Abap, Perl, Julia, Haskell, Pascal.
embed("@khuyentran1401/programming-languages-by-age", cells=["viewof replay", "chart"])