Uploaded image for project: 'CDH (READ-ONLY)'
  1. CDH (READ-ONLY)
  2. DISTRO-512

HiveServer2 leaking Zookeeper connections

    Details

    • Type: Bug
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: CDH4.3.0
    • Fix Version/s: None
    • Component/s: Hive
    • Labels:
    • Environment:

      Description

      We at Swiftype are trying to use hiveserver2 from CDH 4.3.0 in production and we've noticed that every time we execute a query on it, hive leaks one zookeeper connection (I see it staying open forever in lsof). I could easily reproduce it by running a simple connect, execute "show tables" query, disconnect nagios check.

      Using "SET hive.support.concurrency=false" before executing a query prevents ZK connections from leaking, but I'm not sure how safe it is because Cloudera docs here (http://www.cloudera.com/content/cloudera-content/cloudera-docs/CDH4/4.3.0/CDH4-Installation-Guide/cdh4ig_topic_18_5.html) say "Failure to do this will prevent HiveServer2 from handling concurrent query requests and may result in data corruption". So, we use hive.support.concurrency=false in our nagios check, but production workload still uses concurrency and this brings our hive server down few times a day because zookeeper servers start rejecting connections from the hive box.

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              kovyrin Oleksiy Kovyrin
            • Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

              • Created:
                Updated: