Summary: | 1117 Russian cities with city name, region, geographic coordinates and 2020 population estimate. How to use from pathlib import Path import requests import pandas as pd url = ("https://raw.githubusercontent.com/" "epogrebnyak/ru-cities/main/assets/towns.csv") # save file locally p = Path("towns.csv") if not p.exists(): content = requests.get(url).text p.write_text(content, encoding="utf-8") # read as dataframe df = pd.read_csv("towns.csv") print(df.sample(5)) Files: towns.csv - city information regions.csv - list of Russian Federation regions alt_city_names.json - alternative city names Сolumns (towns.csv): Basic info: <code>city</code> - city name (several cities have alternative names marked in <code>alt_city_names.json</code>) <code>population</code> - city population, thousand people, Rosstat estimate as of 1.1.2020 <code>lat,lon</code> - city geographic coordinates Region: <code>region_name</code> - subnational region (oblast, republic, krai or AO) <code>region_iso_code</code> - ISO 3166 code , eg <code>RU-VLD</code> <code>federal_district</code>, eg <code>Центральный</code> City codes: <code>okato</code> <code>oktmo</code> <code>fias_id</code> <code>kladr_id</code> Data sources City list and city population collected from Rosstat publication Регионы России. Основные социально-экономические показатели городов and parsed from publication Microsoft Word files. City list corresponds to this Wikipedia article . Alternative dataset is wiki-based Dadata city dataset (no population data). Comments City groups <code>Ханты-Мансийский</code> and <code>Ямало-Ненецкий</code> autonomous regions excluded to avoid duplication as parts of <code>Тюменская область</code>. Several notable towns are classified as administrative part of larger cities (<code>Сестрорецк</code> is a municpality at Saint-Petersburg, ...
|