close
Scraping vs Indexing of IDX data - WAV Group
WAV Group - Real Estate Technology Consulting and Research
About Us

Subscribe

rss logoLike what you are reading here? Then let us email you with all our new content.



Most Recent Articles

Main Menu


Most Popular Articles
Connect with Us

Victor Lund

Facebook

 


Marilyn Wilson

Facebook

 

 


Mike Audet

Facebook

 

 

 

 

Blog Comments 6 comments »

Scraping vs Indexing of IDX data

MIBOR - the Metropolitan Indianapolis Board of REALTORS has recently responded to a complaint regarding a violation of their IDX rules and regulations - specifically the “anti-scraping” regulations.

MIBOR has correctly responded to the complaint ruling that an IDX property search page which allows data to be copied and re-purposed is in violation of their rules.  I begin this post with this acknowledgement to clarify that MIBOR is in no way behaving badly.  Indeed, they are enforcing their published rules and regulations correctly.

This issue has created a very difficult situation for MIBOR, NAR, and the unfortunate member of MIBOR who is in violation of the rules.  Clearly the member had no intention of violating the rules - it is a case of confusion between the term “scraping” and “indexing” of an agent’s real estate listing website.

Here are some common definitions of the terms:

scraping is a technique in which a computer program extracts data from the display output of another program.

An index is any data structure which improves the performance of look-up.  Indexing is the task of creating that data structure using spiders.

Spiders are programs that automatically fetches data to feed search engine results.  

In this case, Google has indexed listing information on a real estate agent’s website in a way that easily allows a consumer to access that information with a click-through.  At issue here is the notion that Google has scraped components of the listing information and displayed that in the search result which links to the agent’s IDX compliant listing detail page.  As such, Google has both “indexed” and “scraped” the data.

I applaud NAR for stepping up to tackle this issue in a way that serves the best interests of the member, broker, Association of REALTORS, and the MLS.  Modifying this rule to allow for indexing and limiting scraping will take some highly technical lawyering.  Indeed, the differences between scraping and indexing is a matter of use case and intent rather than technique.  The legal bill for sorting this out is likely to be a big one :-)

What I believe is more at issue here is the notion that third party websites like Trulia and Zillow do not need to adhere to these rules - giving them a huge advantage at winning the eyes of consumers through search engine optimization techniques that agents and brokers are forbidden from pursuing.


Scraping vs Indexing of IDX data

http://waves.wavgroup.com/scraping-vs-indexing-of-idx-data
Posted on May 19, 2009 17:38:19 by Blog Author Victor.Lund
Comment from: Richard Silver| Torontoism.com [Visitor] Email · http://www.Torontoism.com
This could be a very contentious issue and does deserve proper attention and clarification. It is quickly becoming a "them and us" conflict. Good on you for taking a stand.
PermalinkPermalink May 19, 2009 19:57:27
Comment from: Rich Bailey [Visitor] Email · http://www.wolfnet.com
You granted the agents legitimacy with respect to their frustration regarding the spirit of the law, while acknowledging MIBOR's legitimate enforcement of the letter of the law. Obviously this matter deserves attention, and the result will impact all segments of the industry. Thanks, Victor, for offering a level-headed perspective on this.
PermalinkPermalink May 20, 2009 09:16:24
Comment from: Victor.Lund [Member] Email · http://www.wavgroup.com
Victor.Lund

Thanks Rich,
Wolfnet's approach to anti-scraping is rather cool. As I understand it, your servers monitor the "behavior" of web page viewing to "recognize" machine-like behavior.



John Mosey mentioned to me that you contacted them with evidence when your systems snared a "scraper" who was stealing data. Cool stuff.

PermalinkPermalink May 20, 2009 09:23:17
Comment from: Mike Bentson [Visitor] Email · http://www.spokanerealtor.com
Victor, I have seen many posts and articles on this topic and appreciate your views. It is highly complex. One point, which I believe many people are overlooking, is that many of the listings scraped/indexed do not belong to the displaying member. Advertising another broker's listing without their written consent is the objection. IDX is intended for member display. Some MLSes participate in listing syndication, we do, which helps populate these non-member sites to benefit the listing broker. But the question is, should a listing for another broker displayed on your compliant IDX website be a means of advertising your services through the search engines? Or am I missing something?
PermalinkPermalink May 21, 2009 13:56:39
Comment from: Victor.Lund [Member] Email · http://www.wavgroup.com
Victor.Lund

Great point.



I cannot refute your perspective Mike - it was never the intent of IDX to allow the advertising of another agent or broker's listing beyond display on the IDX compliant website.


At issue here is the notion of website marketing. In marketing of a website, being friendly to search engines is a cornerstone of online website marketing. If an agent or broker wants to market their website as a resource to gain access to the most current listing inventory available, then displaying samples of content on search engines that validate that claim to the consumer would require indexing.


This is a really sticky issue that can be construed in many different ways. It is an interesting debate.

PermalinkPermalink May 21, 2009 17:49:49
Comment from: Brian Lason [Visitor] Email · http://www.mlstesseract.com
Victor: I think the current MIBOR/NAR view may be justified. I've begun a series of posts at MLSTesseract to discuss this issue in more detail.
-Brian
PermalinkPermalink June 17, 2009 10:32:42
Comment on this article
  Line breaks become <br />


  Remember me


  Allow users to contact you through a message form.
Captcha image.

Please enter the characters from the image above. (case insensitive)

This post has no comments awaiting moderation.

 
 
West Coast
Victor Lund
291 Falcon Crest Drive
Arroyo Grande, CA 93420
Office: (805) 473-9119
Mobile: (805) 748-9118
E-Mail Victor/Marilyn
West Coast
Marilyn Wilson
291 Falcon Crest Drive
Arroyo Grande, CA 93420
Office: (805) 473-9119
Mobile: (805) 748-9118
E-Mail Victor/Marilyn
East Coast
Mike Audet
94 Harper Rd
Snyder, NY 14226
Office: (716) 839-4628
Mobile: (716 )984-9009
Fax: (703)-935-8768
E-Mail Mike