skip to main content
research-article

Modeling of Geographic Dependencies for Real Estate Ranking

Published: 27 August 2016 Publication History

Abstract

It is traditionally a challenge for home buyers to understand, compare, and contrast the investment value of real estate. Although a number of appraisal methods have been developed to value real properties, the performances of these methods have been limited by traditional data sources for real estate appraisal. With the development of new ways of collecting estate-related mobile data, there is a potential to leverage geographic dependencies of real estate for enhancing real estate appraisal. Indeed, the geographic dependencies of the investment value of an estate can be from the characteristics of its own neighborhood (individual), the values of its nearby estates (peer), and the prosperity of the affiliated latent business area (zone). To this end, in this paper, we propose a geographic method, named ClusRanking, for real estate appraisal by leveraging the mutual enforcement of ranking and clustering power. ClusRanking is able to exploit geographic individual, peer, and zone dependencies in a probabilistic ranking model. Specifically, we first extract the geographic utility of estates from geography data, estimate the neighborhood popularity of estates by mining taxicab trajectory data, and model the influence of latent business areas. Also, we fuse these three influential factors and predict real estate investment value. Moreover, we simultaneously consider individual, peer and zone dependencies, and derive an estate-specific ranking likelihood as the objective function. Furthermore, we propose an improved method named CR-ClusRanking by incorporating checkin information as a regularization term which reduces the performance volatility of real estate ranking system. Finally, we conduct a comprehensive evaluation with the real estate-related data of Beijing, and the experimental results demonstrate the effectiveness of our proposed methods.

References

[1]
M. Bailey, R. Muth, and H. Nourse. 1963. A regression method for real estate price index construction. J. Am. Stat. Assoc. 58 (1963), 933--942.
[2]
David M. Blei, Andrew Y. Ng, and Michael I. Jordan. 2003. Latent Dirichlet allocation. J. Mach. Learn. Res. 3 (2003), 993--1022.
[3]
C. J. C. Burges, R. Ragno, and Q. V. Le. 2007. Learning to rank with non-smooth cost functions. In Advances in Neural Information Processing Systems, Volume 19. MIT Press, Cambridge, MA. http://research.microsoft.com/apps/pubs/default.aspx?id=68133.
[4]
Zhe Cao, Tao Qin, Tie-Yan Liu, Ming-Feng Tsai, and Hang Li. 2007. Learning to rank: from pairwise approach to listwise approach. In Proceedings of International Conference on Machine Learning (ICML’07).
[5]
Chen Cheng, Haiqin Yang, Irwin King, and Michael R. Lyu. 2012. Fused matrix factorization with geographical and social influence in location-based social networks. In Proceedings of Conference on Artificial Intelligence (AAAI’12).
[6]
Mary-Lou Downie and Gill Robson. 2008. Automated valuation models: An international perspective. In RICS Automated Valuation Models Conference: AVMs Today and Tomorrow. 4 November 2008, London.
[7]
Bruce J. Feibel. 2003. Investment Performance Measurement, Vol. 116. John Wiley & Sons.
[8]
Gersende Fort and Eric Moulines. 2003. Convergence of the Monte Carlo expectation maximization for curved exponential families. Ann. Stat. 31 (2003), 1220--1259.
[9]
Yoav Freund, Raj Iyer, Robert E. Schapire, and Yoram Singer. 2003. An efficient boosting algorithm for combining preferences. J. Mach. Learn. Res. 4 (2003), 933--969.
[10]
Jerome H. Friedman. 2001. Greedy function approximation: a gradient boosting machine. Ann. Stat. (2001), 1189--1232.
[11]
Yanjie Fu, Bin Liu, Yong Ge, Zijun Yao, and Hui Xiong. 2014a. User preference learning with multiple information fusion for restaurant recommendation. In Proceedings of SIAM International Conference on Data Mining (SDM’14).
[12]
Yanjie Fu, Hui Xiong, Yong Ge, Zijun Yao, Yu Zheng, and Zhi-Hua Zhou. 2014b. Exploiting geographic dependencies for real estate appraisal: a mutual perspective of clustering and ranking. In Proceedings of Conference on Knowledge Discovery and Data Mining (KDD’14).
[13]
Marta C. Gonzalez, Cesar A. Hidalgo, and Albert-Laszlo Barabasi. 2008. Understanding individual human mobility patterns. Nature 453, 7196 (2008), 779--782.
[14]
Dmytro Karamshuk, Anastasios Noulas, Salvatore Scellato, Vincenzo Nicosia, and Cecilia Mascolo. 2013. Geo-spotting: Mining online location-based services for optimal retail store placement. In Proceedings of Conference on Knowledge Discovery and Data Mining (KDD’13). 9.
[15]
Vilius Kontrimas and Antanas Verikas. 2011. The mass appraisal of the real estate by computational intelligence. Appl. Soft Comput. 11 (2011), 443--448.
[16]
John Krainer and Chishen Wei. 2004. House prices and fundamental value. FRBSF Econ. Lett. (2004).
[17]
E. T.-K. Lam. 1996. Modern regression models and neural networks for residential property valuation. In Proceedings of the RICS Research -- The Cutting Edge Conference (1996).
[18]
Ming Li, Hang Li, and Zhi-Hua Zhou. 2009. Semi-supervised document retrieval. Inf. Process. Manag. (2009).
[19]
Bin Liu, Yanjie Fu, Zijun Yao, and Hui Xiong. 2013. Learning geographical preferences for point-of-interest recommendation. In Proceedings of Conference on Knowledge Discovery and Data Mining (KDD’13).
[20]
Donald Metzler and W. Bruce Croft. 2007. Linear feature-based models for information retrieval. Inf. Retr. 10 (2007), 257--274.
[21]
R. Kelley Pace. 1998. Appraisal using generalized additive models. J. Real Estate Res. 15 (1998), 77--100. http://ideas.repec.org/a/jre/issued/v15n11998p77-100.html.
[22]
Chaitra H. Nagaraja, Lawrence D. Brown, and Linda H. Zhao. 2011. An autoregressive approach to house price modeling. The Annals of Applied Statistics (2011), 124--149.
[23]
Ian Ruthven and Mounia Lalmas. 2002. Using Dempster-Shafer’s theory of evidence to combine aspects of information use. J. Intell. Inf. Syst. 19, 3 (2002), 267--301.
[24]
Yue Shi, Martha Larson, and Alan Hanjalic. 2012. Unifying rating-oriented and ranking-oriented collaborative filtering for improved recommendation. Inf. Sci. (2012).
[25]
Robert J. Shiller. 1991. Arithmetic Repeat Sales Price Estimators. Technical Report. Cowles Foundation for Research in Economics, Yale University. 971.
[26]
Chaoming Song, Zehui Qu, Nicholas Blumm, and Albert-László Barabási. 2010. Limits of predictability in human mobility. Science 327, 5968 (2010), 1018--1021.
[27]
Yizhou Sun, Jiawei Han, Peixiang Zhao, Zhijun Yin, Hong Cheng, and Tianyi Wu. 2009a. Rankclus: Integrating clustering with ranking for heterogeneous information network analysis. In Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology. ACM, 565--576.
[28]
Yizhou Sun, Yintao Yu, and Jiawei Han. 2009b. Ranking-based clustering of heterogeneous information networks with star network schema. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 797--806.
[29]
Laura O. Taylor. 2003. The hedonic method. In A Primer on Nonmarket Valuation. Springer.
[30]
Andrew Trotman. 2005. Learning to rank. Inf. Retr. 8 (2005), 359--381.
[31]
Melanie M. Wall. 2004. A close look at the spatial structure implied by the CAR and SAR models. J. Stat. Plan. Inference 121, 2 (2004), 311--324.
[32]
John Burr Williams. 1938. The Theory of Investment Value. Vol. 36. JSTOR.
[33]
Jing Yuan, Yu Zheng, and Xing Xie. 2012. Discovering regions of different functions in a city using human mobility and POIs. In Proceedings of Conference on Knowledge Discovery and Data Mining (KDD’12), Vol. 9.
[34]
Yu Zheng. 2015. Methodologies for cross-domain data fusion: an overview. IEEE Trans. Big Data 1, 1 (2015), 16--34.
[35]
Yu Zheng, Licia Capra, Ouri Wolfson, and Hai Yang. 2014. Urban computing: Concepts, methodologies, and applications. ACM Trans. Intell. Syst. Technol. 5, 3 (2014), 2157--6904.
[36]
Yu Zhou and Donald R. Haurin. 2010. On the determinants of house value volatility. J. Real Estate Res. 32, 4 (2010), 377--395.
[37]
Zhi-Hua Zhou. 2012. Ensemble Methods: Foundations and Algorithms. CRC Press.
[38]
Zhi-Hua Zhou, Ke-Jia Chen, and Hong-Bin Dai. 2006. Enhancing relevance feedback in image retrieval using unlabeled data. ACM Trans. Inf. Syst. 24, 2 (2006), 219--244.

Cited By

View all

Index Terms

  1. Modeling of Geographic Dependencies for Real Estate Ranking

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Transactions on Knowledge Discovery from Data
      ACM Transactions on Knowledge Discovery from Data  Volume 11, Issue 1
      February 2017
      288 pages
      ISSN:1556-4681
      EISSN:1556-472X
      DOI:10.1145/2974720
      Issue’s Table of Contents
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 27 August 2016
      Accepted: 01 May 2016
      Revised: 01 March 2016
      Received: 01 May 2015
      Published in TKDD Volume 11, Issue 1

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Real estate
      2. clustering
      3. geographic dependencies
      4. ranking

      Qualifiers

      • Research-article
      • Research
      • Refereed

      Funding Sources

      • National Science Foun- dation of China
      • 111 Project

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)18
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 10 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all

      View Options

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media