micropython

Wykres commitów

Autor	SHA1	Wiadomość	Data
Damien George	19aee9438a	py/unicode: Clean up utf8 funcs and provide non-utf8 inline versions. This patch provides inline versions of the utf8 helper functions for the case when unicode is disabled (MICROPY_PY_BUILTINS_STR_UNICODE set to 0). This saves code size. The unichar_charlen function is also renamed to utf8_charlen to match the other utf8 helper functions, and the signature of this function is adjusted for consistency (const char* -> const byte*, mp_uint_t -> size_t).	2018-02-14 18:19:22 +11:00
Damien George	4601759bf5	py/objstr: Remove "make_qstr_if_not_already" arg from mp_obj_new_str. This patch simplifies the str creation API to favour the common case of creating a str object that is not forced to be interned. To force interning of a new str the new mp_obj_new_str_via_qstr function is added, and should only be used if warranted. Apart from simplifying the mp_obj_new_str function (and making it have the same signature as mp_obj_new_bytes), this patch also reduces code size by a bit (-16 bytes for bare-arm and roughly -40 bytes on the bare-metal archs).	2017-11-16 13:17:51 +11:00
Damien George	a3dc1b1957	all: Remove inclusion of internal py header files. Header files that are considered internal to the py core and should not normally be included directly are: py/nlr.h - internal nlr configuration and declarations py/bc0.h - contains bytecode macro definitions py/runtime0.h - contains basic runtime enums Instead, the top-level header files to include are one of: py/obj.h - includes runtime0.h and defines everything to use the mp_obj_t type py/runtime.h - includes mpstate.h and hence nlr.h, obj.h, runtime0.h, and defines everything to use the general runtime support functions Additional, specific headers (eg py/objlist.h) can be included if needed.	2017-10-04 12:37:50 +11:00
Damien George	58321dd985	all: Convert mp_uint_t to mp_unary_op_t/mp_binary_op_t where appropriate The unary-op/binary-op enums are already defined, and there are no arithmetic tricks used with these types, so it makes sense to use the correct enum type for arguments that take these values. It also reduces code size quite a bit for nan-boxing builds.	2017-08-29 13:16:30 +10:00
Javier Candeira	35a1fea90b	all: Raise exceptions via mp_raise_XXX - Changed: ValueError, TypeError, NotImplementedError - OSError invocations unchanged, because the corresponding utility function takes ints, not strings like the long form invocation. - OverflowError, IndexError and RuntimeError etc. not changed for now until we decide whether to add new utility functions.	2017-08-13 22:52:33 +10:00
Alexander Steffen	55f33240f3	all: Use the name MicroPython consistently in comments There were several different spellings of MicroPython present in comments, when there should be only one.	2017-07-31 18:35:40 +10:00
Damien George	48d867b4a6	all: Make more use of mp_raise_{msg,TypeError,ValueError} helpers.	2017-06-15 11:54:41 +10:00
Damien George	c88cfe165b	py: Use size_t as len argument and return type of mp_get_index. These values are used to compute memory addresses and so size_t is the more appropriate type to use.	2017-03-23 16:17:40 +11:00
Damien George	ae8d867586	py: Add iter_buf to getiter type method. Allows to iterate over the following without allocating on the heap: - tuple - list - string, bytes - bytearray, array - dict (not dict.keys, dict.values, dict.items) - set, frozenset Allows to call the following without heap memory: - all, any, min, max, sum TODO: still need to allocate stack memory in bytecode for iter_buf.	2017-02-16 18:38:06 +11:00
Damien George	c0d9500eee	py/objstr: Convert mp_uint_t to size_t (and use int) where appropriate.	2017-02-16 16:51:16 +11:00
Paul Sokolovsky	1563388001	py/objstr,objstrunicode: Fix inconistent #if indentation.	2016-08-07 15:24:57 +03:00
Paul Sokolovsky	56eb25f049	py/objstr: Make .partition()/.rpartition() methods configurable. Default is disabled, enabled for unix port. Saves 600 bytes on x86.	2016-08-07 06:46:55 +03:00
Paul Sokolovsky	ed1c194ebf	py/objstrunicode: str_index_to_ptr: Implement positive indexing properly. Order out-of-bounds check, completion check, and increment in the right way.	2016-07-25 19:28:04 +03:00
Paul Sokolovsky	6af90b2972	py/objstrunicode: str_index_to_ptr: Should handle bytes too. There's single str_index_to_ptr() function, called for both bytes and unicode objects, so should handle each properly.	2016-07-25 14:45:08 +03:00
Dave Hylands	6a60fb3cf4	py/objstr*: Properly ifdef str.center().	2016-05-22 01:54:41 +03:00
Paul Sokolovsky	1b5abfcaae	py/objstr: Implement str.center(). Disabled by default, enabled in unix port. Need for this method easily pops up when working with text UI/reporting, and coding workalike manually again and again counter-productive.	2016-05-22 00:13:44 +03:00
Damien George	8212d97317	py: Use polymorphic iterator type where possible to reduce code size. Only types whose iterator instances still fit in 4 machine words have been changed to use the polymorphic iterator. Reduces Thumb2 arch code size by 264 bytes.	2016-01-03 16:27:55 +00:00
Damien George	999cedb90f	py: Wrap all obj-ptr conversions in MP_OBJ_TO_PTR/MP_OBJ_FROM_PTR. This allows the mp_obj_t type to be configured to something other than a pointer-sized primitive type. This patch also includes additional changes to allow the code to compile when sizeof(mp_uint_t) != sizeof(void*), such as using size_t instead of mp_uint_t, and various casts.	2015-11-29 14:25:35 +00:00
Damien George	cbf7674025	py: Add MP_ROM_* macros and mp_rom_* types and use them.	2015-11-29 14:25:04 +00:00
Paul Sokolovsky	1b586f3a73	py: Rename MP_BOOL() to mp_obj_new_bool() for consistency in naming.	2015-10-11 15:18:15 +03:00
Damien George	821b7f22fe	py: Use mp_not_implemented consistently for not implemented features.	2015-09-03 23:14:06 +01:00
Damien George	44e7cbf019	py: Clean up declarations of str type/funcs that are also in unicode. Background: trying to make an amalgamation of all the code gave some errors with redefined types and inconsistent use of static.	2015-05-17 16:44:24 +01:00
Damien George	7f9d1d6ab9	py: Overhaul and simplify printf/pfenv mechanism. Previous to this patch the printing mechanism was a bit of a tangled mess. This patch attempts to consolidate printing into one interface. All (non-debug) printing now uses the mp_print* family of functions, mainly mp_printf. All these functions take an mp_print_t structure as their first argument, and this structure defines the printing backend through the "print_strn" function of said structure. Printing from the uPy core can reach the platform-defined print code via two paths: either through mp_sys_stdout_obj (defined pert port) in conjunction with mp_stream_write; or through the mp_plat_print structure which uses the MP_PLAT_PRINT_STRN macro to define how string are printed on the platform. The former is only used when MICROPY_PY_IO is defined. With this new scheme printing is generally more efficient (less layers to go through, less arguments to pass), and, given an mp_print_t* structure, one can call mp_print_str for efficiency instead of mp_printf("%s", ...). Code size is also reduced by around 200 bytes on Thumb2 archs.	2015-04-16 14:30:16 +00:00
Damien George	0528c5a22a	py: In str unicode, str_subscr will never be passed a bytes object.	2015-04-04 19:42:03 +01:00
Paul Sokolovsky	ac2f7a7f6a	objstr: Add .splitlines() method. splitlines() occurs ~179 times in CPython3 standard library, so was deemed worthy to implement. The method has subtle semantic differences from just .split("\n"). It is also defined as working for any end-of-line combination, but this is currently not implemented - it works only with LF line-endings (which should be OK for text strings on any platforms, but not OK for bytes).	2015-04-04 00:09:48 +03:00
Damien George	2e2e404ff7	py: Allow to compile with extra warnings (sign-compare, unused-param).	2015-03-19 00:25:33 +00:00
Damien George	98e3a64694	py: Remove duplicated mp_obj_str_make_new function from objstrunicode.c.	2015-01-28 14:14:57 +00:00
Paul Sokolovsky	344e15b1ae	objstr: Remove code duplication and unbreak Windows build. There was really weird warning (promoted to error) when building Windows port. Exact cause is still unknown, but it uncovered another issue: 8-bit and unicode str_make_new implementations should be mutually exclusive, and not built at the same time. What we had is that bytes_decode() pulled 8-bit str_make_new() even for unicode build.	2015-01-23 02:15:56 +02:00
Paul Sokolovsky	6113eb2f33	objstr*: Use separate names for locals_dict of 8-bit and unicode str's. To somewhat unbreak -DSTATIC="" compile.	2015-01-23 02:05:58 +02:00
Damien George	0b9ee86133	py: Add mp_obj_new_str_from_vstr, and use it where relevant. This patch allows to reuse vstr memory when creating str/bytes object. This improves memory usage. Also saves code ROM: 128 bytes on stmhal, 92 bytes on bare-arm, and 88 bytes on unix x64.	2015-01-21 23:17:27 +00:00
Damien George	ff8dd3f486	py, unix: Allow to compile with -Wunused-parameter. See issue #699.	2015-01-20 12:47:20 +00:00
Damien George	51dfcb4bb7	py: Move to guarded includes, everywhere in py/ core. Addresses issue #1022.	2015-01-01 20:32:09 +00:00
Paul Sokolovsky	e62a0fe367	objstr: Allow to convert any buffer proto object to str. Original motivation is to support converting bytearrays, but easier to just support buffer protocol at all.	2014-10-31 00:03:53 +02:00
Damien George	cde0ca21bf	py: Simplify JSON str printing (while still conforming to JSON spec). The JSON specs are relatively flexible and allow us to use one function to print strings, be they ascii, bytes or utf-8 encoded.	2014-09-25 17:35:56 +01:00
Damien George	612045f53f	py: Add native json printing using existing print framework. Also add start of ujson module with dumps implemented. Enabled in unix and stmhal ports. Test passes on both.	2014-09-17 22:56:34 +01:00
Damien George	4abff7500f	py: Change uint to mp_uint_t in runtime.h, stackctrl.h, binary.h. Part of code cleanup, working towards resolving issue #50.	2014-08-30 14:59:21 +01:00
Damien George	ecc88e949c	Change some parts of the core API to use mp_uint_t instead of uint/int. Addressing issue #50, still some way to go yet.	2014-08-30 00:35:11 +01:00
Damien George	bb4c6f35c6	py: Make MP_OBJ_NEW_SMALL_INT cast arg to mp_int_t itself. Addresses issue #724.	2014-07-31 10:49:14 +01:00
Damien George	40f3c02682	Rename machine_(u)int_t to mp_(u)int_t. See discussion in issue #50.	2014-07-03 13:25:24 +01:00
Paul Sokolovsky	9e215fa4c2	py: Make unichar_charlen() accept/return machine_uint_t.	2014-06-28 23:15:29 +03:00
Damien George	e04a44e2f6	py: Small comments, name changes, use of machine_int_t.	2014-06-28 10:27:23 +01:00
Paul Sokolovsky	ea2c936c7e	objstrunicode: Refactor str_index_to_ptr() following objstr.	2014-06-27 00:04:20 +03:00
Paul Sokolovsky	00c904b47a	objstrunicode: Signedness issues.	2014-06-27 00:04:19 +03:00
Paul Sokolovsky	79b7fe2ee5	objstrunicode: Implement iterator.	2014-06-27 00:04:19 +03:00
Paul Sokolovsky	cdc020da4b	objstrunicode: Re-add buffer protocol back for now, required for io.StringIO.	2014-06-27 00:04:18 +03:00
Paul Sokolovsky	e7f2b4c875	objstrunicode: Revamp len() handling for unicode, and optimize bool().	2014-06-27 00:04:18 +03:00
Paul Sokolovsky	86d3898e70	objstrunicode: Get rid of bytes checking, it's separate type.	2014-06-27 00:04:18 +03:00
Paul Sokolovsky	9731912ccb	py: Prune unneeded code from objstrunicode, reuse code in objstr.	2014-06-27 00:04:18 +03:00
Chris Angelico	64b468d873	objstrunicode: Basic implementation of unicode handling. Squashed commit of the following: commit `99dc21b67a` Author: Chris Angelico <rosuav@gmail.com> Date: Thu Jun 12 02:18:54 2014 +1000 Optimize as per TODO (thanks Damien!) commit `5bf0153eca` Author: Chris Angelico <rosuav@gmail.com> Date: Tue Jun 10 08:42:06 2014 +1000 Test a default (= UTF-8) encode and decode commit `c962057ac3` Merge: `e2c9782` `195de32` Author: Chris Angelico <rosuav@gmail.com> Date: Tue Jun 10 05:23:03 2014 +1000 Merge branch 'master' into unicode, resolving conflict on py/obj.h commit `e2c9782a65` Author: Chris Angelico <rosuav@gmail.com> Date: Tue Jun 10 05:05:57 2014 +1000 More whitespace fixups commit `086a2a0f57` Author: Chris Angelico <rosuav@gmail.com> Date: Tue Jun 10 05:04:20 2014 +1000 Properly implement string slicing commit `0d339a143e` Author: Chris Angelico <rosuav@gmail.com> Date: Tue Jun 10 02:24:11 2014 +1000 Support slicing in str_index_to_ptr, and fix a bounds error commit `24371c7267` Author: Chris Angelico <rosuav@gmail.com> Date: Tue Jun 10 02:10:22 2014 +1000 Break out index-to-pointer calculation into a function commit `616c24ac01` Author: Chris Angelico <rosuav@gmail.com> Date: Tue Jun 10 02:03:11 2014 +1000 Add tests of string slicing, which currently fail commit `a24d19f676` Author: Chris Angelico <rosuav@gmail.com> Date: Tue Jun 10 01:56:53 2014 +1000 Change string indexing to not precalculate the charlen, and add test for neg indexing commit `0bcc7ab89e` Author: Chris Angelico <rosuav@gmail.com> Date: Sun Jun 8 22:09:17 2014 +1000 Clean up constant qstr declarations now that charlen isn't needed commit `5473e1a1db` Author: Chris Angelico <rosuav@gmail.com> Date: Sun Jun 8 07:18:42 2014 +1000 Remove the charlen field from strings, calculating it when required commit `5c1658ec71` Author: Chris Angelico <rosuav@gmail.com> Date: Sun Jun 8 07:11:27 2014 +1000 Get rid of mp_obj_str_get_data_len() which was used in only one place commit `a019ba968b` Author: Chris Angelico <rosuav@gmail.com> Date: Sun Jun 8 06:58:26 2014 +1000 Add a unichar_charlen() function to calculate length-in-characters from length-in-bytes commit `44b0d5cff8` Author: Chris Angelico <rosuav@gmail.com> Date: Sun Jun 8 06:32:44 2014 +1000 Use utf8_get/next_char in building up a string's repr commit `30d1bad33f` Author: Chris Angelico <rosuav@gmail.com> Date: Sun Jun 8 06:10:45 2014 +1000 Make utf8_get_char() and utf8_next_char() actually do what their names say commit `bc990dad9a` Author: Chris Angelico <rosuav@gmail.com> Date: Sun Jun 8 02:10:59 2014 +1000 Revert "Add PEP 393-flags to strings and stub usage." This reverts commit `c239f50952`. commit `f9bebb28ad` Author: Chris Angelico <rosuav@gmail.com> Date: Sat Jun 7 15:41:48 2014 +1000 Whitespace fixes commit `279de0c8eb` Author: Chris Angelico <rosuav@gmail.com> Date: Sat Jun 7 15:28:35 2014 +1000 Formatting/layout improvements - introduce macros for UTF-8 byte detection, add braces. No functional changes. commit `f1911f53d5` Author: Chris Angelico <rosuav@gmail.com> Date: Sat Jun 7 11:56:02 2014 +1000 Make chr() Unicode-aware commit `f51ad737b4` Author: Chris Angelico <rosuav@gmail.com> Date: Sat Jun 7 11:44:07 2014 +1000 Make a string's repr Unicode-aware commit `01bd686846` Author: Chris Angelico <rosuav@gmail.com> Date: Sat Jun 7 11:33:43 2014 +1000 Expand the Unicode tests commit `7bc91904f8` Author: Chris Angelico <rosuav@gmail.com> Date: Sat Jun 7 11:27:30 2014 +1000 Record byte lengths for byte strings commit `bb13212071` Author: Chris Angelico <rosuav@gmail.com> Date: Sat Jun 7 11:25:06 2014 +1000 Make ord() Unicode-aware commit `03f0cbe905` Author: Chris Angelico <rosuav@gmail.com> Date: Sat Jun 7 10:24:35 2014 +1000 Retain characters as UTF-8 encoded Unicode commit `e924659b85` Author: Chris Angelico <rosuav@gmail.com> Date: Sat Jun 7 08:37:27 2014 +1000 Add support for \u and \U escapes, but not \N (with explanatory comment) commit `231031ac5f` Author: Chris Angelico <rosuav@gmail.com> Date: Sat Jun 7 05:09:35 2014 +1000 Add character length to qstr commit `6df1b946fb` Author: Chris Angelico <rosuav@gmail.com> Date: Fri Jun 6 13:48:36 2014 +1000 Add test of UTF-8 encoded source file resulting in properly formed string commit `16429b81a8` Author: Chris Angelico <rosuav@gmail.com> Date: Fri Jun 6 13:44:15 2014 +1000 Make len(s) return character length (even though creation's still buggy) commit `cd2cf6663c` Author: Chris Angelico <rosuav@gmail.com> Date: Fri Jun 6 13:15:36 2014 +1000 HACK - When indexing a qstr, count its charlen. Stupidly inefficient but POC. All tests pass now, though string creation is still buggy. commit `47c234584d` Author: Chris Angelico <rosuav@gmail.com> Date: Fri Jun 6 13:15:32 2014 +1000 objstr: Record character length separately from byte length CAUTION: Buggy, may crash stuff - qstr needs equivalent functionality too commit `b0f41c72af` Author: Chris Angelico <rosuav@gmail.com> Date: Fri Jun 6 05:37:36 2014 +1000 Beginnings of UTF-8 support - construct strings from that many UTF-8-encoded chars, and subscript bytes the same way commit `89452be641` Author: Chris Angelico <rosuav@gmail.com> Date: Fri Jun 6 05:28:47 2014 +1000 Update comments - now aiming for UTF-8 rather than PEP 393 strings commit `c239f50952` Author: Chris Angelico <rosuav@gmail.com> Date: Wed Jun 4 05:28:12 2014 +1000 Add PEP 393-flags to strings and stub usage. The test suite all passes, but nothing has actually been changed.	2014-06-27 00:04:17 +03:00
Paul Sokolovsky	83865347db	objstrunicode: Complete copy of objstr, to be patched for unicode support.	2014-06-27 00:04:17 +03:00

50 Commity (36e474e83fa26cb78a9312dce5dc53a467c5d8b7)