在哪里下载电影数据集?

Posted

技术标签:

【中文标题】在哪里下载电影数据集?【英文标题】:Where to download the movie dataset? 【发布时间】:2012-08-30 21:34:52 【问题描述】:

我想在单个文件中下载包含电影名称和演员列表等基本信息的电影转储。我寻找了几个选项,例如 http://api.themoviedb.org/2.1/ 和 http://api.themoviedb.org/2.1/ 。 TheMovieDB 不提供批量下载数据的选项。 IMDB 有数据,但似乎分散在文件中。此外,我无法弄清楚如何将演员、电影名称等单独文件中的数据拼接起来,因为它们似乎没有任何通用键。如果我在这里遗漏了什么,请告诉我。

有人可以告诉我如何下载电影数据集吗?

【问题讨论】:

抱歉,这个网站是关于编程的问题,而不是“我在哪里可以找到 xyz”。 @Anony-Mousse :我需要这个大型电影数据集,因为我有一些应用程序我想在其中应用数据挖掘原则。我什至已经正确地标记了我的帖子。所以,我想你会想重新考虑我关于编程的问题的有效性。 好吧,一旦你有一个实际的数据挖掘问题,这个问题就更合适了。 @Anony-Mousse :如果上述评论对您来说听起来有点粗鲁,我深表歉意。但关键是,在我有一个数据集可以使用之前,我无法实现我的想法。只是为了给这个讨论一点编程的感觉,我确实尝试过网络抓取***页面来提取电影信息。但是您可以向此类网站发出的请求数量是有限制的。此外,这些数据并不完全详尽,需要大量时间才能生成。因此这个问题!我希望你能得到我提出这个问题的动机以及它与我的项目的相关性。 尝试下载***,而不是网页抓取。甚至还有一个称为 DBPedia 的 RDF 版本,它是 Wikipedia 的 解析 版本。尝试使用这些。 【参考方案1】:

您可以使用 Freebase 以 JSON 格式下载 movies 和 actors。请参阅API wiki 了解更多信息。

例如查询:

GET https://www.googleapis.com/freebase/v1/mqlread?query=[%22type%22:%22/film/actor%22,%22id%22:null,%22name%22:null]

将返回:


  "result": [
    "type": "/film/actor",
    "id": "/en/milla_jovovich",
    "name": "Milla Jovovich"
  , 
    "type": "/film/actor",
    "id": "/en/angus_macfadyen",
    "name": "Angus Macfadyen"
  , 
    "type": "/film/actor",
    "id": "/en/aisha_tyler",
    "name": "Aisha Tyler"
  , 
    "type": "/film/actor",
    "id": "/en/stephen_dorff",
    "name": "Stephen Dorff"
  , 
    "type": "/film/actor",
    "id": "/en/vincent_laresca",
    "name": "Vincent Laresca"
  , 
    "type": "/film/actor",
    "id": "/en/dawn_greenhalgh",
    "name": "Dawn Greenhalgh"
  , 
    "type": "/film/actor",
    "id": "/en/nola_augustson",
    "name": "Nola Augustson"
  , 
    "type": "/film/actor",
    "id": "/en/dudley_moore",
    "name": "Dudley Moore"
  , 
    "type": "/film/actor",
    "id": "/en/julie_andrews",
    "name": "Julie Andrews"
  , 
    "type": "/film/actor",
    "id": "/en/bo_derek",
    "name": "Bo Derek"
  , 
    "type": "/film/actor",
    "id": "/en/robert_webber",
    "name": "Robert Webber"
  , 
    "type": "/film/actor",
    "id": "/en/dee_wallace-stone",
    "name": "Dee Wallace-Stone"
  , 
    "type": "/film/actor",
    "id": "/en/ryan_phillippe",
    "name": "Ryan Phillippe"
  , 
    "type": "/film/actor",
    "id": "/en/salma_hayek",
    "name": "Salma Hayek"
  , 
    "type": "/film/actor",
    "id": "/en/neve_campbell",
    "name": "Neve Campbell"
  , 
    "type": "/film/actor",
    "id": "/en/mike_myers",
    "name": "Mike Myers"
  , 
    "type": "/film/actor",
    "id": "/en/satoshi_tsumabuki",
    "name": "Satoshi Tsumabuki"
  , 
    "type": "/film/actor",
    "id": "/en/masanobu_ando",
    "name": "Masanobu Ando"
  , 
    "type": "/film/actor",
    "id": "/en/david_gahan",
    "name": "Dave Gahan"
  , 
    "type": "/film/actor",
    "id": "/en/martin_gore",
    "name": "Martin Gore"
  , 
    "type": "/film/actor",
    "id": "/en/andrew_fletcher_1961",
    "name": "Andrew Fletcher"
  , 
    "type": "/film/actor",
    "id": "/en/alan_wilder",
    "name": "Alan Wilder"
  , 
    "type": "/film/actor",
    "id": "/en/gerard_butler",
    "name": "Gerard Butler"
  , 
    "type": "/film/actor",
    "id": "/en/lena_headey",
    "name": "Lena Headey"
  , 
    "type": "/film/actor",
    "id": "/en/david_wenham",
    "name": "David Wenham"
  , 
    "type": "/film/actor",
    "id": "/en/robert_de_niro",
    "name": "Robert De Niro"
  , 
    "type": "/film/actor",
    "id": "/en/gerard_depardieu",
    "name": "G\u00e9rard Depardieu"
  , 
    "type": "/film/actor",
    "id": "/en/dominique_sanda",
    "name": "Dominique Sanda"
  , 
    "type": "/film/actor",
    "id": "/en/john_belushi",
    "name": "John Belushi"
  , 
    "type": "/film/actor",
    "id": "/en/ned_beatty",
    "name": "Ned Beatty"
  , 
    "type": "/film/actor",
    "id": "/en/dan_aykroyd",
    "name": "Dan Aykroyd"
  , 
    "type": "/film/actor",
    "id": "/en/lorraine_gary",
    "name": "Lorraine Gary"
  , 
    "type": "/film/actor",
    "id": "/en/murray_hamilton",
    "name": "Murray Hamilton"
  , 
    "type": "/film/actor",
    "id": "/en/robert_downey_jr",
    "name": "Robert Downey Jr."
  , 
    "type": "/film/actor",
    "id": "/en/kiefer_sutherland",
    "name": "Kiefer Sutherland"
  , 
    "type": "/film/actor",
    "id": "/en/winona_ryder",
    "name": "Winona Ryder"
  , 
    "type": "/film/actor",
    "id": "/en/john_hurt",
    "name": "John Hurt"
  , 
    "type": "/film/actor",
    "id": "/en/richard_burton",
    "name": "Richard Burton"
  , 
    "type": "/film/actor",
    "id": "/en/suzanna_hamilton",
    "name": "Suzanna Hamilton"
  , 
    "type": "/film/actor",
    "id": "/en/cyril_cusack",
    "name": "Cyril Cusack"
  , 
    "type": "/film/actor",
    "id": "/en/gregor_fisher",
    "name": "Gregor Fisher"
  , 
    "type": "/film/actor",
    "id": "/en/tony_leung_chiu_wai",
    "name": "Tony Leung Chiu Wai"
  , 
    "type": "/film/actor",
    "id": "/en/gong_li",
    "name": "Gong Li"
  , 
    "type": "/film/actor",
    "id": "/en/faye_wong",
    "name": "Faye Wong"
  , 
    "type": "/film/actor",
    "id": "/en/takuya_kimura",
    "name": "Takuya Kimura"
  , 
    "type": "/film/actor",
    "id": "/en/zhang_ziyi",
    "name": "Zhang Ziyi"
  , 
    "type": "/film/actor",
    "id": "/en/carina_lau",
    "name": "Carina Lau"
  , 
    "type": "/film/actor",
    "id": "/en/chang_chen",
    "name": "Chang Chen"
  , 
    "type": "/film/actor",
    "id": "/en/bird_mcintyre",
    "name": "Bird McIntyre"
  , 
    "type": "/film/actor",
    "id": "/en/maggie_cheung",
    "name": "Maggie Cheung"
  , 
    "type": "/film/actor",
    "id": "/en/chevy_chase",
    "name": "Chevy Chase"
  , 
    "type": "/film/actor",
    "id": "/en/steve_martin",
    "name": "Steve Martin"
  , 
    "type": "/film/actor",
    "id": "/en/martin_short",
    "name": "Martin Short"
  , 
    "type": "/film/actor",
    "id": "/en/joe_mantegna",
    "name": "Joe Mantegna"
  , 
    "type": "/film/actor",
    "id": "/en/jon_lovitz",
    "name": "Jon Lovitz"
  , 
    "type": "/film/actor",
    "id": "/en/alfonso_arau",
    "name": "Alfonso Arau"
  , 
    "type": "/film/actor",
    "id": "/en/tony_plana",
    "name": "Tony Plana"
  , 
    "type": "/film/actor",
    "id": "/en/al_pacino",
    "name": "Al Pacino"
  , 
    "type": "/film/actor",
    "id": "/en/carmen_maura",
    "name": "Carmen Maura"
  , 
    "type": "/film/actor",
    "id": "/en/luis_hostalot",
    "name": "Luis Hostalot"
  , 
    "type": "/film/actor",
    "id": "/en/veronica_forque",
    "name": "Veronica Forqu\u00e9"
  , 
    "type": "/film/actor",
    "id": "/en/hume_cronyn",
    "name": "Hume Cronyn"
  , 
    "type": "/film/actor",
    "id": "/en/jessica_tandy",
    "name": "Jessica Tandy"
  , 
    "type": "/film/actor",
    "id": "/en/frank_mcrae",
    "name": "Frank McRae"
  , 
    "type": "/film/actor",
    "id": "/en/elizabeth_pena",
    "name": "Elizabeth Pe\u00f1a"
  , 
    "type": "/film/actor",
    "id": "/en/dennis_boutsikaris",
    "name": "Dennis Boutsikaris"
  , 
    "type": "/film/actor",
    "id": "/en/hal_warren",
    "name": "Hal Warren"
  , 
    "type": "/film/actor",
    "id": "/en/tom_neyman",
    "name": "Tom Neyman"
  , 
    "type": "/film/actor",
    "id": "/en/john_reynolds_1941",
    "name": "John Reynolds"
  , 
    "type": "/film/actor",
    "id": "/en/rajnikanth",
    "name": "Rajnikanth"
  , 
    "type": "/film/actor",
    "id": "/en/sridevi_kapoor",
    "name": "Sridevi Kapoor"
  , 
    "type": "/film/actor",
    "id": "/en/kantimathi",
    "name": "Kantimathi"
  , 
    "type": "/film/actor",
    "id": "/en/konkona_sen_sharma",
    "name": "Konkona Sen Sharma"
  , 
    "type": "/film/actor",
    "id": "/en/shabana_azmi",
    "name": "Shabana Azmi"
  , 
    "type": "/film/actor",
    "id": "/en/soumitra_chatterjee",
    "name": "Soumitra Chatterjee"
  , 
    "type": "/film/actor",
    "id": "/en/waheeda_rehman",
    "name": "Waheeda Rehman"
  , 
    "type": "/film/actor",
    "id": "/en/rahul_bose",
    "name": "Rahul Bose"
  , 
    "type": "/film/actor",
    "id": "/en/william_hopper",
    "name": "William Hopper"
  , 
    "type": "/film/actor",
    "id": "/en/joan_taylor",
    "name": "Joan Taylor"
  , 
    "type": "/film/actor",
    "id": "/en/frank_puglia",
    "name": "Frank Puglia"
  , 
    "type": "/film/actor",
    "id": "/en/james_garner",
    "name": "James Garner"
  , 
    "type": "/film/actor",
    "id": "/en/rod_taylor_1930",
    "name": "Rod Taylor"
  , 
    "type": "/film/actor",
    "id": "/en/eva_marie_saint",
    "name": "Eva Marie Saint"
  , 
    "type": "/film/actor",
    "id": "/en/paul_walker",
    "name": "Paul Walker"
  , 
    "type": "/film/actor",
    "id": "/en/eva_mendes",
    "name": "Eva Mendes"
  , 
    "type": "/film/actor",
    "id": "/en/devon_aoki",
    "name": "Devon Aoki"
  , 
    "type": "/film/actor",
    "id": "/en/john_payne_1912",
    "name": "John Payne"
  , 
    "type": "/film/actor",
    "id": "/en/evelyn_keyes",
    "name": "Evelyn Keyes"
  , 
    "type": "/film/actor",
    "id": "/en/brad_dexter",
    "name": "Brad Dexter"
  , 
    "type": "/film/actor",
    "id": "/en/frank_faylen",
    "name": "Frank Faylen"
  , 
    "type": "/film/actor",
    "id": "/en/peggie_castle",
    "name": "Peggie Castle"
  , 
    "type": "/film/actor",
    "id": "/en/jean-hugues_anglade",
    "name": "Jean-Hugues Anglade"
  , 
    "type": "/film/actor",
    "id": "/en/beatrice_dalle",
    "name": "B\u00e9atrice Dalle"
  , 
    "type": "/film/actor",
    "id": "/en/vincent_lindon",
    "name": "Vincent Lindon"
  , 
    "type": "/film/actor",
    "id": "/en/dominique_pinon",
    "name": "Dominique Pinon"
  , 
    "type": "/film/actor",
    "id": "/en/joaquin_phoenix",
    "name": "Joaquin Phoenix"
  , 
    "type": "/film/actor",
    "id": "/en/james_gandolfini",
    "name": "James Gandolfini"
  , 
    "type": "/film/actor",
    "id": "/en/catherine_keener",
    "name": "Catherine Keener"
  , 
    "type": "/film/actor",
    "id": "/en/norman_reedus",
    "name": "Norman Reedus"
  , 
    "type": "/film/actor",
    "id": "/en/dean_martin",
    "name": "Dean Martin"
  ]

类似,你会这样做:

https://www.googleapis.com/freebase/v1/mqlread?query=[%22type%22:%22/film/film%22,%22id%22:null,%22name%22:null]

获取电影标题。

【讨论】:

感谢 BioGeek 提供的信息 :) 但我仍然不明白如何使用它来连接电影和演员?对此有什么想法吗?我可以查询演员,也可以查询电影,但我要查找的信息是电影及其对应的演员。

以上是关于在哪里下载电影数据集?的主要内容,如果未能解决你的问题,请参考以下文章

KITTI数据集百度网盘

scikit-learn数据集下载太慢的问题

手写英文字符数据集..在哪里获得(并且公开可用)[关闭]

示例星型模式数据集

UCI数据集怎么用?

如何在 C# 中将数据集加载到 libsvm 中