Group 1. Movie Portal

    As movies become an increasingly popular entertainment for people of all ages, people always concern about how to distinguish a good one in a sea of choices. As the advertisements are usually confusing, people may prefer to search for some details about movies and do some comparison.

Our movies data portal can support people to search the basic information of movies including movie name, cast, director, country, language, duration, released date and types of movie. More importantly, this portal will integrate the users’ comments (ranks) and box office trends which it’s helpful for user to distinguish whether the positive (or negative) comments come from real users or paid posters. With abundant data, the system could even classify whether a movie hire posters or not (e.g. the movie with less box office but have a large amount of comments in total or a large amount of comments submitted in a short period) and that is meaningful when people view the movie’s ranks and comments before they make decisions and go to watch movies.

Data Source

1.       Basic information of movies - douban.com

Douban.com provide API for user to request movie information and this API will return movie name, cast, director, country, language, duration ,released date in JSON format.

2.  Movie box office - imdb.com

 An online database about movie integrated details, including directors, actors, running time, rating, comments, as well as box offices and so on.

3.  Ranks and comments of movies - douban.com

We have to implement a spider to crawl the ranks and comments of movies on douban.com. The rank is a weighted rank calculated by Douban.com and comment contains user ID (user identity), review, rank, submitted date.

4.  Movie box office - maoyan.com

A movie box office statistics website which indicates movie box office, box office ratio.

 

Group Members

Luo Ziyang   20297556

He Jinfeng   20294671

Shi yuxi     20299554

 

Group 2 Climate Portal

Data portal usages:

Climate portal is a integrated data portal which used to rank the similarity between the main cities located all around the world.

Description:

You can easily find the climate of city which include attributes city longitude, city latitude, temperature, pressure, sea level, ground level, humidity, wind, rain, snow, etc. In terms of rank list for one or more cities. This portal enables people who requires similar climate but do not want to live in the current city to migrate to the best suitable place. It also helps to investigate the global climate change effect, and the science of crops or plants grow observation.

Data sources:

http://openweathermap.org

https://www.yahoo.com/news/weather/

http://www.weather-forecast.com

Group Memebers:

Yjdong@connect.ust.hk 20302492

Cdengab@connect.ust.hk 20292075

Jlicd@connect.ust.hk 20296320

 

Group3 Sport News Portal

Data portal usages: Sports Information

By mainly collecting the information from the above data sources, we want to find some useful knowledge which can help soccer clubs find the right player they need.Here is the main idea of the Portal:

1.there are three entities behind the portal:Soccer players, Soccer Clubs and Games. For players,we want to get the info such as goal,support,pass,price etc(in one or two seasons).Of course, the height, weight, belonging is included. For Clubs, the coach, player list, latest game info is a basic. For  games, something happened in one game will be important like scores, the home and away.

Data Sources:

sports.sina.com.cn

www.chashenjia.com(

soccer.hupu.com

 

Group memebers:

 

WANG Xingjian 20301539(xwangcc)

WEN Junjie 20301151(jwenaf)

WU Mian 20301424(mwuam)

 

Group 4 Movie and Actor Portal

In this project, our team will implement a data portal which is for movie and actor searches. The input is a query which is a combination of terms related to movies or actors and the output is the detailed information of the movie or actor or a sorted list of movies or actors. The original data is crawled down from web site, data mining technologies such as feature extraction, entity identification, entity disambiguation, etc will be used to extract information from the web site and to build the data base.

 

Data source:

1) https://www.wikipedia.org/

api: http://www.programmableweb.com/api/wikipedia

2) http://www.imdb.com/

api: http://www.omdbapi.com/

3) https://www.themoviedb.org/?language=zh

api: https://www.themoviedb.org/documentation/api

 

Group Member:

QI, XIAOXU 20298859 xqiab@connect.ust.hk

JIN, YUE 20295728 yjinah@connect.ust.hk

CHEN, GUANHAO 20292221 gchenah@connect.ust.hk

 

Group 5 Music Portal

Because one company can’t buy all the copyright of all the songs, users can’t use one music player to listen to all the music, and we hope to use this portal to combine the music and singers in these three website and use can check which music player they can use to listen the music they want to listen to.

Data source:

y.qq.com,

music.163.com,

music.baidu.com

Group Member:

 

XIONG Qi  20301864

CAI Xinjia  20292934

QIU Jinyuan  20298770

 

Group 6. Hotels and Flights Portal

Sometimes we may need to (or want to) go for a trip to somewhere, stay for several days, and then go back. And most of the time we may take a flight because it's fast and comfortable.

However, which flight/hotel should we choose is always a problem that may take us so much time to solve. Thus, our team want to build this data portal to recommend hotels and flights.

Choose your origin and destination, and the days of go and back, the portal will give you some recommended hotels and flights, depending on their ratings, locations and so on.

Data Source: 

http://english.ctrip.com/

http://www.tripadvisor.com/

http://www.hotels.com/

Group Members:

 JIAN Xun  20292685 xjian@ust.hk

LEI Xiayu  20297817 xlei@ust.hk

SHAO Heng 20299322 hshaoab@ust.hk

 Group 7:  Movie and Actor

Our data portal is mainly used for prediction and query which you can not get the result from google about movies and actors. For example, user can know whose movies have the best quality in recent 10 years or we can predict how many movies

would be released next year. Also, a change of audience taste for movies and actors could be shown through our data portal in terms of the comments, rankings, and the award situation. We can recommend movies if user specify the type, actors, time and

other requirements as well.

 

Data Sources:

Top Rated Movies:

http://www.imdb.com/chart/top?ref_=nv_mv_250_6

 

Movies of Different Years:

http://www.movieinsider.com/movies/-

http://www.imdb.com/year/

http://movieweb.com/movies/2015/

 

Movies, Directors, Actors Awards:

http://www.imdb.com/awardscentral/?

pf_rd_m=A2FGELUUNOQJNL&pf_rd_p=2399153622&pf_rd_

r=19Q6C2TV9E96MCJP0ESR&pf_rd_s=top-

1&pf_rd_t=15091&pf_rd_i=oscars&ref_=ac_ac_acd_nav_i1

 

Group Members:

LI, Derui dlian@connect.ust.hk 20293055

LIU, Dan dliuag@connect.ust.hk 20292972

ZHANG, Haowei hzhangbc@connect.ust.hk 20301735

 

Group 8: Books & Authors Portal

Our main data portal is about ebooks and authors. 

Data Source:

http://www.ebooks-share.net/

http://www.free-ebooks.net

https://www.overdrive.com/

https://www.goodreads.com/

Group Members:

Yun Tianjiao, 20302844, tyun@connect.ust.hk

Yu Yaoyao, 20303094, yyuay@connect.ust.hk

Zhou Rongqi, 20303824, rzhouac@connect.ust.hk

Group 9: Pop Music Portal

We are going to create a data portal of pop songs. This portal can be used to do simple analysis on the trend and popularity of the pop musics. We are planning to have 3 tables at this stage, to store the information about singers (artists), songs and the music albums. The detailed attributes of these entities will be shown in the following list.

 

Data sources :

Discogs A

music database which provides free API and supports XML.

AllMusic A

website provides information and reviews for musics.

● Ultimate Music Database A

detailed pop music database contains over 300,000 artists.

● Last.fm A

Music community website. Online music player.

 

Group Members:

CHEN, Jiahang (20292295) jchencg@connect.ust.hk

CAO, Hengrui (20294516) hcaoab@connect.ust.hk

LAM, Wai Kit (20296124) wklamag@connect.ust.hk

The project is to build a data portal for enquiry of company information and open job positions in HK job market. The data portal will consolidate the open job position and company information from the most popular job hunting websites (Jobsdb, Recruit.com, cpjobs.com). Data analysis and data visualization will be performed to provide some HK job market analysis (e.g. average salary by industry, average salary by career level, etc.) in the data portal.

Group 10. The Job Protal

Data sources:

·Jobsdb.com: http://hk.jobsdb.com/hk

·Recruit.com: http://recruit.com.hk/

·cpjobs.com:http://www.cpjobs.com/hk/?j=1

 

Group Members:

Ren Chun (crenab@connect.ust.hk)

Yang Bao (byangah@connect.ust.hk)

Group 11. The NBA Players Data Portal

Abstract:

Data sources:

nba.sports.sina.com.cn

g.hupu.com/nba/stats/players

www.juhe.cn

Usage:

Retrieve NBA serving players personal information

Retrieve NBA serving players last two seasons data statistics

Find the relation between players age and performance

Group member

 

LI TIANYUAN  20303003  tliap@connect.ust.hk

LI AO  20296576  aliaj@connect.ust.hk

HE XIANG  20294592  xheam@connect.ust.hk

 

Group 12. Movie Portal

Data sources

http://www.imdb.com/

http://www.rottentomatoes.com/

http://www.cinemablend.com/

 

Group member

Team Members

 

Hung Chi Hung  20329581

TONG Ka Wai  20300341

Terence Yuen  20300224

 

Group 13 Crowd Investment Portal

Internet crowdfunding is a recent phenomenon allowing people to fund different kind of projects. The process is similar to that of a business angel but the backers are usually middle class citizens. This practice is usually performed via internet. Many websites exist nowadays to promote different projects to be crowdfunded : Kickstarter, Ulule, Gofundme and so on. Kickstarter for example helped to launch over 100, 000 projects with over 2 million dollars pledged.

 

Data Sources

The three data sources for our project will be Kickstarter, Indiegogo and Pozible.

Group members

 

LABBE, Kevin Patrick Joseph

20304828

kpjlabbe@ust.hk

MARTYNAVA, Karina

20300547

kmartynava@ust.hk

THOMPSON, Julien Edward

20305119

jethompsonaa@ust.hk

 

Group 14: Researcher Portal

The project is to provide the famous experts’ information by searching the keywords in computer science on the website. The information includes the author, email, department and the publication list according to the integrated assessment of DBLP, C-DBLP and Bib Sonomy.

Data Sources:

DBLP: DBLP is a computer science bibliography website hosted at Universität Trier, in Germany.

http://dblp.uni-trier.de/

C-DBLP: C-DBLP integrates authoritative computer journals and conference papers in China to provide a good browsing for researchers of literature data query service.

http://c-dblp.cn/

Bib Sonomy: BibSonomy is a social bookmarking and publication-sharing system. It offers users the ability to store and organize their bookmarks and publication entries and supports the integration of different communities and people by offering a social platform for literature exchange.

http://www.bibsonomy.org/

 

Group members:

WANG Jingwei                  jwangcn@connect.ust.hk

JIANG Yu             yjiangav@connect.ust.hk

Group 15: Property Portal

Hotel Portal is a data portal that displays a recommended hotel to stay in for a certain location and time. This data portal will acquire hotel data such as description, price, room availability, hotel reviews, and many more from various different API (Application Programming Interface).

This data portal will require the user(s) to input their location of stay, duration of stay, and how many people will be staying. The data portal will then browse through all the data entities and display the recommended hotels and other important information for the user(s) to stay in.

 

DATA SOURCES

 

The API that this data portal will use is:

-              SkyScanner Business API (http://business.skyscanner.net/portal/en-GB/Documentation) (XML)

SkyScanner Business API provides travel search products such as flight rates, car hires, and hotel information. The API usage requires the user(s) to register for the API key, which will be granted access to the live pricing of hotels, car hire, and flights. Display requirements also require the data portal owner to display a SkyScanner logo.

 

-              Expedia Affiliate Network API (http://developer.ean.com/) (XML)

Expedia Affiliate Network (EAN) is one of the world’s largest travel companies. EAN works with partners in 33 countries to get hotel booking informations. EAN provides a near-complete lists of travel information such as hotel lists, room availability, hotel fee, and many more. API users will have to sign up to EAN to retrieve an API key and a secret key in order to retrieve data from their database.

 

-              TripAdvisor API (https://developer-tripadvisor.com/content-api/) (JSON)

TripAdvisor provides up-to-date hotel listings, ratings, reviews, and many more information. The API is a location-based information and upon using this API on a data portal, it is required for the data portal owner to include the TripAdvisor logo and many more display requirements on the website.

 

TEAM MEMBERS

Kristanto Sean Njotoprawiro, 20296289, ksn@connect.ust.hk

Ricky Dwiputra SETIAMANAH, 10563787,  dsricky@connect.ust.hk