转载

简单扯扯 PyCharm 4.5 中新加的 Python Profiler 功能

对python开发者来说，相信大家对Jetbrains出的python编辑器 PyCharm 应该都有了解。对我自己来说，每天项目中都在用。自从半年前从C#转到python之后直到现在依旧还在适应，适应这门语言，适应编辑器（不过也老怀恋CSharp和VS了，MS大法也很好！）。

这小半年下来发现其实写写python也还行的：

语言本身的表达性很好，非常简洁。个人很喜欢python的 装饰器 ，做一些横向的复用效果非常好。
有活跃的开源社区，各种开源框架、开源库。

几个月的python码下来，也养成了一些坏毛病，不管有意识还是无意识的经常很想把一大坨逻辑用一行代码解决，并不太关心代码的效率，再加上有 python语言的效率本来就很差 这条被大家说烂掉的观点加持，更加有恃无恐，呵呵。反过来想想，在coding的过程中多权衡，多尝试一些更好的方式，边写边重构才更能帮助你更好的coding和成长吧，而且在整个过程中有很多的乐趣，适当培养一些对代码的 洁癖 还是棒棒哒。

差点都跑的没背影了，言归正传。在写代码的过程中如果碰到需要对代码执行效率权衡的话，能够量化的做一些profile就更好了，常用的一些方式有借助一些像 profile 、 hotshot 、 timeit 来手动的做一些测量，不过这样需搞有一些缺点：

需要手动写一些测试的代码；
需要对输出的结果进行一定的排版和美化，方便阅读；
收集的信息有限

锵锵锵，终于轮到主角出场了。

PyCharm4.5中加入了一个新的功能： Python Profiler ，只需要运行 Run | Profile… 就能使用，默认支持使用 cProfile ，提供两种视图： Statistics 和 Call Graph :

简单扯扯 PyCharm 4.5 中新加的 Python Profiler 功能

Call Graph

Statistic

两种视图下面显示的内容是一样的，包含函数被调用所执行的时间，次数等，只不过在”Call Graph“视图下面，代码执行过程中各个函数被调用的层次关系都被完美的显示出来，按调用耗时纵向排列，而且还辅以警告的颜色来进行提示。在各个节点上点击右键能够看到 Navigate to Source 的操作，点击能够导航到相应的源码处，非常方便查看代码。

在这里我们以 二叉查找树

为例，展示一些比如像二叉树查找、遍历等基本操作在python profiler里面运行的效果：

import time class Node(object):  def __init__(self, data, left, right):   self.data = data   self.left = left   self.right = right  def show(self):   return self.data class BSTree(object):  def __init__(self):   self.root = None  def insert(self, data):   temp_node = Node(data, None, None)   if self.root is None:    self.root = temp_node   else:    current_node = self.root    while True:     parent_node = current_node     if data < current_node.data:      current_node = current_node.left      if current_node is None:       parent_node.left = temp_node       break     else:      current_node = current_node.right      if current_node is None:       parent_node.right = temp_node       break  def get_min(self):   current_node = self.root   while current_node.left is not None:    current_node = current_node.left   return current_node.data  def get_max(self):   current_node = self.root   while current_node.right is not None:    current_node = current_node.right   return current_node.data  def find(self, data):   current_node = self.root   while current_node is not None:    if current_node.data == data:     return current_node    elif data < current_node.data:     current_node = current_node.left    else:     current_node = current_node.right   return None

测试代码：

from BSTree import BSTree import random def build_bst_tree(num):  bst_tree = BSTree()  source = list(xrange(1, num))  while num > 0 and source:   data = random.choice(source)   source.remove(data)   bst_tree.insert(data)   num -= 1  return bst_tree def in_order(node):  node_str = ""  if node is not None:   in_order(node.left)   print node.show(),   in_order(node.right) my_bst_tree = build_bst_tree(500) in_order(my_bst_tree.root) my_bst_tree.get_min() my_bst_tree.get_max() my_bst_tree.find(123)

依次点击菜单 Run | Profile... 会生成并打开一个 xxxx.pstat 的Tab，里面就是分别按 Statistics 和 Call graph 两种视图展示的profile结果：

profile.png

在profile的结果中我们能够看到，整个文件执行共耗时13ms，二叉树遍历 in_order 和二叉树生成函数 build_bst_tree 基本占了大部分运行的时间，占用的时间越长，节点的颜色越深，还是比较体贴的小功能。

我们对寻找二叉树最大值节点的函数 get_max 稍稍修改一下 time.sleep(5)

，拖延一下执行的时间：

def get_max(self):         time.sleep(5)         current_node = self.root         while current_node.right is not None:             current_node = current_node.right         return current_node.data

现在再来看一下修改之后的profile的结果：