近日,在测试某个项目时,无意中发现在客户机的机器上可以直接运行一条Python命令来执行服务器端的Python脚本,故而,深入测试一下便有了下文。
很多时候,因为业务的需要我们常常需要使用Python –c exec方法在客户机上来执行远程服务器上的Python脚本或者命令。
那么,在这种情况下,因为在命令是运行在客户机上,这就必然导致了远程服务器上的Python脚本会以一定的形式运行在客户机的内存中,如果我们可以获取并还原出这些代码,这也在一定程度上造成了服务器源码的泄露。
为了验证这种泄露风险,下面是我依据一个真实案例而创建了一个简单的演示Demo:
PyOrign.py 文件:
#!python #!/usr/bin/env python import random import base64 class Test: x='' y='' def __init__(self, a, b): self.x = a self.y = b print "Initiation..., I'm from module Test" def add(self): print 'a =',self.x print 'b =',self.y c = self.x+self.y print 'sum =', c if __name__ == '__main__': print "/n[+] I'm the second .py script!" a = Test(1,2) a.add()
test.py文件:
#!python #!/usr/bin/env python import imp ifimp.get_magic() != '/x03/xf3/r/n': print "Please update to Python 2.7.10 (http://www.python.org/download/)" exit() importurllib, marshal, zlib, time, re, sys print "[+] Hello, I'm the first .py script!" _S = "http" _B = "10.66.110.151" execmarshal.loads(urllib.urlopen('%s://%s/mystatic/pyCode' % (_S, _B)).read())
接下来我们开始演示效果,首先在客户端执行以下命令:
#!bash python -c "exec(__import__('urllib2').urlopen('http://10.66.110.151/test/').read())"
运行后的结果显示如下:
简单分析一下这个过程,我们不难发现上面的命令在被执行后实际上发生的过程是这样的:
首先利用urllib2的urlopen方法来读取远程服务器上的命令代码
然后判断客户机上的python的版本是不是2.7.10,如果是,则执行下面的代码继续获取远程服务器上的可执行代码: exec marshal.loads(urllib.urlopen('http://10.66.110.151/mystatic/pyCode').read())
接着,又利用urllib的urlopen方法读取远程服务器上的可执行代码:
最后exec方法在客户机上执行marshal.loads方法反序列化后的code object对象
细心的朋友可能已经发现,在步骤3我们并没有像步骤1那样获取到exec执行的源码而是一个codeobject对象。那么我们不禁要思考一下,有没有办法将这个code object对象还原成真正的Python源码呢?如果可以,是不是也就意味着服务器上的源码存在这很大的泄露风险呢?
我们知道exec语句用来执行储存在字符串或文件中的Python语句,这既可以Python语句也可以是经过compile编译后的exec模式的code object对象。那么此处,不禁要思考获取到的code object是不是就是服务器上的Python脚本经过compile编译后的exec模式的code object对象呢?如果是的,那么只要我们能够构造出这个原始脚本编译后的pyc文件,也就意味着我们可以通过pyc文件来进一步还原出脚本的原始py文件。
接下来我们就来看看如何利用已知的codeobject对象来构造一个编译后的pyc文件。
首先,我们来分析一下pyc文件的构成。一个完整的pyc文件是由以下几部分组成:
那么,我们是否已经具备这几部分。首先是四字节的魔数Magic int, 返回到上面分析过程中的步骤1,我们看到了下面一段代码:
#!python import imp ifimp.get_magic() != '/x03/xf3/r/n': print "Please update to Python 2.7.10 (http://www.python.org/download/)" exit()</code>
此处代码就是通过Magic int来判断客户主机上的Python版本信息,那么不用说这里的Magic int也就是 imp.get_magic()
获取到的值。
接下来是四字节的pyc的时间戳,经过我的测试发现此处的时间戳可以是任意符合格式的四字节int。
最后是序列化了的PyCodeObject对象,那么这个我们也有吗?没错,我们在步骤3中读取到的codeobject对象就是这个PyCodeObject对象。
既然构造pyc所具有的三个组成部分我们都有了,我们就来尝试构造一下这个pyc文件吧。按照猜测,远程服务器应该是通过compile方法来编译原始的脚本文件,那么我们就利用同样的方法来构造它。
这里我们利用了库文件py_compile的compile方法,其具体代码实现如下:
#!python """Routine to "compile" a .py file to a .pyc (or .pyo) file. This module has intimate knowledge of the format of .pyc files. """ import __builtin__ import imp import marshal importos import sys importtraceback MAGIC = imp.get_magic() __all__ = ["compile", "main", "PyCompileError"] classPyCompileError(Exception): """Exception raised when an error occurs while attempting to compile the file. To raise this exception, use raisePyCompileError(exc_type,exc_value,file[,msg]) where exc_type: exception type to be used in error message type name can be accesses as class variable 'exc_type_name' exc_value: exception value to be used in error message can be accesses as class variable 'exc_value' file: name of file being compiled to be used in error message can be accesses as class variable 'file' msg: string message to be written as error message If no value is given, a default exception message will be given, consistent with 'standard' py_compile output. message (or default) can be accesses as class variable 'msg' """ def __init__(self, exc_type, exc_value, file, msg=''): exc_type_name = exc_type.__name__ ifexc_type is SyntaxError: tbtext = ''.join(traceback.format_exception_only(exc_type, exc_value)) errmsg = tbtext.replace('File "<string>"', 'File "%s"' % file) else: errmsg = "Sorry: %s: %s" % (exc_type_name,exc_value) Exception.__init__(self,msg or errmsg,exc_type_name,exc_value,file) self.exc_type_name = exc_type_name self.exc_value = exc_value self.file = file self.msg = msg or errmsg def __str__(self): return self.msg defwr_long(f, x): """Internal; write a 32-bit int to a file in little-endian order.""" f.write(chr( x & 0xff)) f.write(chr((x >> 8) & 0xff)) f.write(chr((x >> 16) & 0xff)) f.write(chr((x >> 24) & 0xff)) def compile(file, cfile=None, dfile=None, doraise=False): """Byte-compile one Python source file to Python bytecode. Arguments: file: source filename cfile: target filename; defaults to source with 'c' or 'o' appended ('c' normally, 'o' in optimizing mode, giving .pyc or .pyo) dfile: purported filename; defaults to source (this is the filename that will show up in error messages) doraise: flag indicating whether or not an exception should be raised when a compile error is found. If an exception occurs and this flag is set to False, a string indicating the nature of the exception will be printed, and the function will return to the caller. If an exception occurs and this flag is set to True, a PyCompileError exception will be raised. Note that it isn't necessary to byte-compile Python modules for execution efficiency -- Python itself byte-compiles a module when it is loaded, and if it can, writes out the bytecode to the corresponding .pyc (or .pyo) file. However, if a Python installation is shared between users, it is a good idea to byte-compile all modules upon installation, since other users may not be able to write in the source directories, and thus they won't be able to write the .pyc/.pyo file, and then they would be byte-compiling every module each time it is loaded. This can slow down program start-up considerably. See compileall.py for a script/module that uses this module to byte-compile all installed files (or all files in selected directories). """ with open(file, 'U') as f: try: timestamp = long(os.fstat(f.fileno()).st_mtime) exceptAttributeError: timestamp = long(os.stat(file).st_mtime) codestring = f.read() try: codeobject = __builtin__.compile(codestring, dfile or file,'exec') exceptException,err: py_exc = PyCompileError(err.__class__, err, dfile or file) ifdoraise: raisepy_exc else: sys.stderr.write(py_exc.msg + '/n') return ifcfile is None: cfile = file + (__debug__ and 'c' or 'o') with open(cfile, 'wb') as fc: fc.write('/0/0/0/0') wr_long(fc, timestamp) marshal.dump(codeobject, fc) fc.flush() fc.seek(0, 0) fc.write(MAGIC) def main(args=None): """Compile several source files. The files named in 'args' (or on the command line, if 'args' is not specified) are compiled and the resulting bytecode is cached in the normal manner. This function does not search a directory structure to locate source files; it only compiles files named explicitly. If '-' is the only parameter in args, the list of files is taken from standard input. """ ifargs is None: args = sys.argv[1:] rv = 0 ifargs == ['-']: while True: filename = sys.stdin.readline() if not filename: break filename = filename.rstrip('/n') try: compile(filename, doraise=True) exceptPyCompileError as error: rv = 1 sys.stderr.write("%s/n" % error.msg) exceptIOError as error: rv = 1 sys.stderr.write("%s/n" % error) else: for filename in args: try: compile(filename, doraise=True) exceptPyCompileError as error: # return value to indicate at least one failure rv = 1 sys.stderr.write(error.msg) returnrv if __name__ == "__main__": sys.exit(main())
在上面的代码中,我们可以看出,compile方法首先利用 imp.get_magic()
生成Magic int:
#!python MAGIC = imp.get_magic()
然后根据py文件的创建时间来生成时间戳:
#!python timestamp = long(os.fstat(f.fileno()).st_mtime
最后利用 __builtin__.compile
方法生成exec模式的code object对象,并使用 marshal.dump
方法将codeobject写入到pyc文件中
#!python codeobject = __builtin__.compile(codestring, dfile or file,'exec')
知道了原理,接下来我们可以利用下面的脚本来构造pyc文件:
#!python """Routine to "compile" a .py file to a .pyc (or .pyo) file. This module has intimate knowledge of the format of .pyc files. """ import __builtin__ import imp import marshal importos import sys importtraceback importzlib importurllib MAGIC = imp.get_magic() #根据Python版本信息生成的魔数 __all__ = ["compile", "main", "PyCompileError"] classPyCompileError(Exception): """Exception raised when an error occurs while attempting to compile the file. To raise this exception, use raisePyCompileError(exc_type,exc_value,file[,msg]) where exc_type: exception type to be used in error message type name can be accesses as class variable 'exc_type_name' exc_value: exception value to be used in error message can be accesses as class variable 'exc_value' file: name of file being compiled to be used in error message can be accesses as class variable 'file' msg: string message to be written as error message If no value is given, a default exception message will be given, consistent with 'standard' py_compile output. message (or default) can be accesses as class variable 'msg' """ def __init__(self, exc_type, exc_value, file, msg=''): exc_type_name = exc_type.__name__ ifexc_type is SyntaxError: tbtext = ''.join(traceback.format_exception_only(exc_type, exc_value)) errmsg = tbtext.replace('File "<string>"', 'File "%s"' % file) else: errmsg = "Sorry: %s: %s" % (exc_type_name,exc_value) Exception.__init__(self,msg or errmsg,exc_type_name,exc_value,file) self.exc_type_name = exc_type_name self.exc_value = exc_value self.file = file self.msg = msg or errmsg def __str__(self): return self.msg defwr_long(f, x): """Internal; write a 32-bit int to a file in little-endian order.""" f.write(chr( x & 0xff)) f.write(chr((x >> 8) & 0xff)) f.write(chr((x >> 16) & 0xff)) f.write(chr((x >> 24) & 0xff)) def compile(file, cfile=None, dfile=None, doraise=False): """Byte-compile one Python source file to Python bytecode. Arguments: file: source filename cfile: target filename; defaults to source with 'c' or 'o' appended ('c' normally, 'o' in optimizing mode, giving .pyc or .pyo) dfile: purported filename; defaults to source (this is the filename that will show up in error messages) doraise: flag indicating whether or not an exception should be raised when a compile error is found. If an exception occurs and this flag is set to False, a string indicating the nature of the exception will be printed, and the function will return to the caller. If an exception occurs and this flag is set to True, a PyCompileError exception will be raised. Note that it isn't necessary to byte-compile Python modules for execution efficiency -- Python itself byte-compiles a module when it is loaded, and if it can, writes out the bytecode to the corresponding .pyc (or .pyo) file. However, if a Python installation is shared between users, it is a good idea to byte-compile all modules upon installation, since other users may not be able to write in the source directories, and thus they won't be able to write the .pyc/.pyo file, and then they would be byte-compiling every module each time it is loaded. This can slow down program start-up considerably. See compileall.py for a script/module that uses this module to byte-compile all installed files (or all files in selected directories). """ timestamp = long(1449234682) #可以是随机生成的时间戳 try: codeobject = marshal.loads(urllib.urlopen('http://10.66.110.151/mystatic/pyCode').read()) # 反序列化获取远程服务器上的code object对象 exceptException,err: py_exc = PyCompileError(err.__class__, err, dfile or file) ifdoraise: raisepy_exc else: sys.stderr.write(py_exc.msg + '/n') return ifcfile is None: cfile = file + (__debug__ and 'c' or 'o') with open(cfile, 'wb') as fc: fc.write('/0/0/0/0') wr_long(fc, timestamp) marshal.dump(codeobject, fc) # 将序列化后的code object对象写入到pyc文件 fc.flush() fc.seek(0, 0) fc.write(MAGIC) def main(args=None): """Compile several source files. The files named in 'args' (or on the command line, if 'args' is not specified) are compiled and the resulting bytecode is cached in the normal manner. This function does not search a directory structure to locate source files; it only compiles files named explicitly. If '-' is the only parameter in args, the list of files is taken from standard input. """ ifargs is None: args = sys.argv[1:] rv = 0 ifargs == ['-']: while True: filename = sys.stdin.readline() if not filename: break filename = filename.rstrip('/n') try: compile(filename, doraise=True) exceptPyCompileError as error: rv = 1 sys.stderr.write("%s/n" % error.msg) exceptIOError as error: rv = 1 sys.stderr.write("%s/n" % error) else: for filename in args: try: compile(filename, doraise=True) exceptPyCompileError as error: # return value to indicate at least one failure rv = 1 sys.stderr.write(error.msg) returnrv if __name__ == "__main__": compile('pyOrigin.py')
保存脚本为pyOrigin_compile.py在Python 2.7.10下运行即可构造出pyOrigin.pyc文件:
然后利用 uncompyle2 ,即可将pyOrigin.pyc还原成pyOrigin.py。至此,我们成功地还原了原始的Python脚本。
分析思路:
主要潜在危害在于可造成远程服务器Python源码泄露。
相关资料: