“Most real estate companies don’t understand that the analytics process, or any tech process, involves a sophisticated pipeline of ideation in a machine-suitable way: data collection, cleaning, processing, storage, building learning algorithms, and then using models in production. Most of these processes sound boring and too in the weeds, but without them you have zero chance of success using data and analytics.”
One big problem: The current proptech environment is full of hype and unrealistic claims about what technology can do. Panknin said this leads to unrealistic expectations from real estate companies and results in numerous failed projects.
To educate the real estate industry, the center has an initiative around developing an open-source database for real estate-related data that is ready for AI/machine learning applications. “The idea here is that most real estate companies can’t get value from data scientists because they don’t have enough data to work with,” Panknin said. “And most real estate companies aren’t going to pay for data scientists to clean data for two years before they start seeing a payoff from them. So by forming this database, real estate companies can more quickly get value from data scientists and engineers.”
Among the center’s clients are Deloitte, JPMorgan and RXR, all of which have deep financial interests in real estate development and investment. Panknin leads the projects for the companies. The AI center has been working with RXR and JPMorgan since June 2020, and with Deloitte since September 2021.
One of Panknin’s main jobs is to be what he described as the translator between the technologists and real estate professionals. “There is a gap between them that makes effective communication nonexistent,” he said. “You cannot bring a real estate person together with a technical person and solve complex problems well. You have to have translators in the middle.”
The center, for example, is working with RXR on understanding how and why individual neighborhoods within metropolitan areas often move at sometimes drastically different levels of price appreciation.
“The goal was to partner Columbia with our in-house team that’s working on this full time,” said Andrew Min, senior vice president of strategy and digital initiatives at RXR. “How can we find an academic partner that can bring to bear the latest and greatest that academia understands in terms of advances in data science, new algorithms and new approaches, as well as partner with students that live this full time, and help them understand the practical side of data science, statistics and machine learning?”
RXR’s thesis is that data can and will be used increasingly by the commercial real estate industry to change everything, said Min. “Specifically the areas we’re excited about are making better investment decisions, and using data to inform how to operate our buildings better and ultimately to create better customer experiences.”
Data engineering and advances in data science as an academic discipline, along with the explosion in the amount of data sources, has led to similar changes in retail, hospitality, and essentially in every other industry outside of real estate, he added. “We think those same forces will operate in real estate as well.”
The RXR project uses around three to five Columbia graduate data science students, and is focused on the near and long term, said Min.
“The neighborhood growth model is still ongoing,” he said. “That’s one that probably will still take some time, but we try to have clear deliverables at the end of each semester. So, today, what we’ve done is we’ve analyzed a lot of different data sources — traditional economic data, but also things like building permits, 311 calls, Yelp data, transit patterns and demographics. We’ve also done a lot of work on data architecture and engineering. What’s the sensible way to connect all this data together and store it in a way that’s usable and scalable?”
It’s not all academic, of course. RXR is a successful real estate developer and owner, and wants any project results to further that success.
“We’ve also started building shorter-term products that our investments team can use,” Min said. “For example, we built a platform where they can see places of interest as described by Yelp, overlaid around our various investments. You can see bars, coffee shops, parks or other types of places of interest near our given investments, and you can better understand their character.”
In addition, RXR tries to make the students’ experience working with the company a rich one, said Yoann Poirier, lead data scientist at RXR.
“We tried to focus on what the students are going to get from the project,” said Poirier. “Even though the project is going for multiple years, we’re trying to get a reward at the end of a semester by making sure that the students get some knowledge from that. Running a project like that is not always fun. There’s a lot of data cleaning, for instance, when we use public data. This is something that can be valuable for the students, but sometimes not so much, because it can be a lot of work.
“So we want to make sure that they gain the most out of it by also working on the data engineering, as well as data science, and studying building predictive models. Even though we’re not comfortable to actually say, ‘We finished a predictive model that has been moved to a prediction,’ we’ll make sure the students actually gain some knowledge in the process.”