Group 1. Movie Portal
As movies become an increasingly popular entertainment for
people of all ages, people always concern about how to distinguish a good one
in a sea of choices. As the advertisements are usually confusing, people may
prefer to search for some details about movies and do some comparison.
Our movies data portal can
support people to search the basic information of movies including movie name,
cast, director, country, language, duration, released date and types of movie.
More importantly, this portal will integrate the users’ comments (ranks) and
box office trends which it’s helpful for user to distinguish whether the
positive (or negative) comments come from real users or paid posters. With
abundant data, the system could even classify whether a movie hire posters or
not (e.g. the movie with less box office but have a large amount of comments in
total or a large amount of comments submitted in a short period) and that is
meaningful when people view the movie’s ranks and comments before they make
decisions and go to watch movies.
1. Basic information of movies - douban.com
Douban.com provide API for user to request movie information and this API will return movie name, cast, director, country, language, duration ,released date in JSON format.
2. Movie box office - imdb.com
An online database about movie integrated details, including directors, actors, running time, rating, comments, as well as box offices and so on.
3. Ranks and comments of movies - douban.com
We have to implement a spider to crawl the ranks and comments of movies on douban.com. The rank is a weighted rank calculated by Douban.com and comment contains user ID (user identity), review, rank, submitted date.
4. Movie box office - maoyan.com
A movie box office statistics website which indicates movie box office, box office ratio.
Group Members
Shi yuxi 20299554
Group 2 Climate Portal
Data portal usages:
Climate portal is a integrated data portal which used to rank the similarity between the main cities located all around the world.
Description:
You can easily find the climate of city which include attributes city longitude, city latitude, temperature, pressure, sea level, ground level, humidity, wind, rain, snow, etc. In terms of rank list for one or more cities. This portal enables people who requires similar climate but do not want to live in the current city to migrate to the best suitable place. It also helps to investigate the global climate change effect, and the science of crops or plants grow observation.
Data sources:
https://www.yahoo.com/news/weather/
http://www.weather-forecast.com
Group Memebers:
Yjdong@connect.ust.hk
20302492
Cdengab@connect.ust.hk
20292075
Jlicd@connect.ust.hk
20296320
Group3 Sport News Portal
Data portal usages: Sports Information
By mainly collecting the information from the above data sources, we want to find some useful knowledge which can help soccer clubs find the right player they need.Here is the main idea of the Portal:
1.there are three entities behind the portal:Soccer players, Soccer Clubs and Games. For players,we want to get the info such as goal,support,pass,price etc(in one or two seasons).Of course, the height, weight, belonging is included. For Clubs, the coach, player list, latest game info is a basic. For games, something happened in one game will be important like scores, the home and away.
Data Sources:
sports.sina.com.cn
www.chashenjia.com(
soccer.hupu.com
Group memebers:
WANG Xingjian 20301539(xwangcc)
WEN Junjie 20301151(jwenaf)
WU Mian 20301424(mwuam)
Group 4 Movie and Actor Portal
In this project, our team will implement a data portal which is for movie and actor searches. The input is a query which is a combination of terms related to movies or actors and the output is the detailed information of the movie or actor or a sorted list of movies or actors. The original data is crawled down from web site, data mining technologies such as feature extraction, entity identification, entity disambiguation, etc will be used to extract information from the web site and to build the data base.
Data source:
1) https://www.wikipedia.org/
api: http://www.programmableweb.com/api/wikipedia
2) http://www.imdb.com/
api: http://www.omdbapi.com/
3) https://www.themoviedb.org/?language=zh
api: https://www.themoviedb.org/documentation/api
Group Member:
QI, XIAOXU 20298859 xqiab@connect.ust.hk
JIN, YUE 20295728 yjinah@connect.ust.hk
CHEN, GUANHAO 20292221 gchenah@connect.ust.hk
Group 5 Music Portal
Because one company can’t buy all the copyright of all the songs, users can’t use one music player to listen to all the music, and we hope to use this portal to combine the music and singers in these three website and use can check which music player they can use to listen the music they want to listen to.
Data source:
y.qq.com,
music.163.com,
music.baidu.com
Group Member:
XIONG Qi 20301864
CAI Xinjia 20292934
QIU Jinyuan 20298770
Group 6. Hotels and Flights Portal
Sometimes we may need to (or want to) go for a trip to somewhere, stay for several days, and then go back. And most of the time we may take a flight because it's fast and comfortable.
However, which flight/hotel should we choose is always a problem that may take us so much time to solve. Thus, our team want to build this data portal to recommend hotels and flights.
Choose your origin and destination, and the days of go and back, the portal
will give you some recommended hotels and flights, depending on their ratings,
locations and so on.
Data Source:
Group Members:
JIAN Xun 20292685 xjian@ust.hk
LEI Xiayu 20297817 xlei@ust.hk
SHAO Heng 20299322 hshaoab@ust.hk
Group 7:
Movie and Actor
Our data portal is mainly used for prediction and query which you can not get the result from google about movies and actors. For example, user can know whose movies have the best quality in recent 10 years or we can predict how many movies
would be released next year. Also, a change of audience taste for movies and actors could be shown through our data portal in terms of the comments, rankings, and the award situation. We can recommend movies if user specify the type, actors, time and
other requirements as well.
Data Sources:
Top Rated Movies:
http://www.imdb.com/chart/top?ref_=nv_mv_250_6
Movies of Different Years:
http://www.movieinsider.com/movies/-
http://www.imdb.com/year/
http://movieweb.com/movies/2015/
Movies, Directors, Actors Awards:
http://www.imdb.com/awardscentral/?
pf_rd_m=A2FGELUUNOQJNL&pf_rd_p=2399153622&pf_rd_
r=19Q6C2TV9E96MCJP0ESR&pf_rd_s=top-
1&pf_rd_t=15091&pf_rd_i=oscars&ref_=ac_ac_acd_nav_i1
Group Members:
LI, Derui dlian@connect.ust.hk 20293055
LIU, Dan dliuag@connect.ust.hk 20292972
ZHANG, Haowei hzhangbc@connect.ust.hk 20301735
Group 8: Books & Authors Portal
Our main data portal is about ebooks and authors.
Data Source:
Group Members:
Yun Tianjiao, 20302844, tyun@connect.ust.hk
Yu Yaoyao, 20303094, yyuay@connect.ust.hk
Zhou Rongqi, 20303824, rzhouac@connect.ust.hk
Group 9: Pop Music Portal
We are going to create a data portal of pop songs. This portal can be used to do simple analysis on the trend and popularity of the pop musics. We are planning to have 3 tables at this stage, to store the information about singers (artists), songs and the music albums. The detailed attributes of these entities will be shown in the following list.
Data sources :
● Discogs A
music database which provides free API and supports XML.
● AllMusic A
website provides information and reviews for musics.
● Ultimate Music Database A
detailed pop music database contains over 300,000 artists.
● Last.fm A
Music community website. Online music player.
Group Members:
CHEN, Jiahang (20292295) jchencg@connect.ust.hk
CAO, Hengrui (20294516) hcaoab@connect.ust.hk
LAM, Wai Kit (20296124) wklamag@connect.ust.hk
The project is to build a data portal for enquiry of company
information and open job positions in HK job market. The data portal will
consolidate the open job position and company information from the most popular
job hunting websites (Jobsdb, Recruit.com,
cpjobs.com). Data analysis and data visualization will be performed to provide
some HK job market analysis (e.g. average salary by industry, average salary by
career level, etc.) in the data portal.
Group 10. The Job Protal
Data sources:
·Jobsdb.com: http://hk.jobsdb.com/hk
·Recruit.com: http://recruit.com.hk/
·cpjobs.com:http://www.cpjobs.com/hk/?j=1
Group Members:
Ren Chun (crenab@connect.ust.hk)
Yang Bao (byangah@connect.ust.hk)
Group 11. The NBA Players Data Portal
Abstract:
Data sources:
nba.sports.sina.com.cn
g.hupu.com/nba/stats/players
www.juhe.cn
Usage:
Retrieve NBA serving players personal
information
Retrieve NBA serving players last two
seasons data statistics
Find the relation between players age and performance
Group member
LI TIANYUAN 20303003 tliap@connect.ust.hk
LI AO 20296576 aliaj@connect.ust.hk
HE XIANG 20294592 xheam@connect.ust.hk
Group 12. Movie Portal
Data sources
http://www.imdb.com/
http://www.rottentomatoes.com/
Group member
Team Members
Hung Chi Hung 20329581
TONG Ka Wai 20300341
Terence Yuen 20300224
Group 13 Crowd Investment Portal
Internet crowdfunding is a recent phenomenon allowing people to fund different kind of projects. The process is similar to that of a business angel but the backers are usually middle class citizens. This practice is usually performed via internet. Many websites exist nowadays to promote different projects to be crowdfunded : Kickstarter, Ulule, Gofundme and so on. Kickstarter for example helped to launch over 100, 000 projects with over 2 million dollars pledged.
Data Sources
The three data sources for our project will be Kickstarter, Indiegogo and Pozible.
Group members
LABBE, Kevin Patrick Joseph |
20304828 |
kpjlabbe@ust.hk |
MARTYNAVA, Karina |
20300547 |
kmartynava@ust.hk |
THOMPSON, Julien Edward |
20305119 |
jethompsonaa@ust.hk |
Group 14: Researcher Portal
The
project is to provide the famous experts’ information by searching the keywords
in computer science on the website. The information includes the author, email,
department and the publication list according to the integrated assessment of
DBLP, C-DBLP and Bib Sonomy.
Data Sources:
DBLP: DBLP is a computer science bibliography website hosted at Universität Trier, in Germany.
http://dblp.uni-trier.de/
C-DBLP: C-DBLP integrates authoritative computer journals and conference papers in China to provide a good browsing for researchers of literature data query service.
http://c-dblp.cn/
Bib Sonomy: BibSonomy is a social bookmarking and publication-sharing system. It offers users the ability to store and organize their bookmarks and publication entries and supports the integration of different communities and people by offering a social platform for literature exchange.
Group members:
WANG Jingwei jwangcn@connect.ust.hk
JIANG Yu yjiangav@connect.ust.hk
Group 15: Property Portal
Hotel Portal is a data portal that displays a recommended hotel to stay in for a certain location and time. This data portal will acquire hotel data such as description, price, room availability, hotel reviews, and many more from various different API (Application Programming Interface).
This data portal will require the user(s) to input their location of stay, duration of stay, and how many people will be staying. The data portal will then browse through all the data entities and display the recommended hotels and other important information for the user(s) to stay in.
DATA SOURCES
The API that this data portal will use is:
- SkyScanner Business API (http://business.skyscanner.net/portal/en-GB/Documentation) (XML)
SkyScanner Business API provides travel search products such as flight rates, car hires, and hotel information. The API usage requires the user(s) to register for the API key, which will be granted access to the live pricing of hotels, car hire, and flights. Display requirements also require the data portal owner to display a SkyScanner logo.
- Expedia Affiliate Network API (http://developer.ean.com/) (XML)
Expedia Affiliate Network (EAN) is one of the world’s largest travel companies. EAN works with partners in 33 countries to get hotel booking informations. EAN provides a near-complete lists of travel information such as hotel lists, room availability, hotel fee, and many more. API users will have to sign up to EAN to retrieve an API key and a secret key in order to retrieve data from their database.
- TripAdvisor API (https://developer-tripadvisor.com/content-api/) (JSON)
TripAdvisor provides up-to-date hotel listings, ratings, reviews, and many more information. The API is a location-based information and upon using this API on a data portal, it is required for the data portal owner to include the TripAdvisor logo and many more display requirements on the website.
TEAM MEMBERS
Kristanto Sean Njotoprawiro, 20296289, ksn@connect.ust.hk
Ricky Dwiputra SETIAMANAH, 10563787, dsricky@connect.ust.hk