Halite helpers #43

harrisse · 2020-06-03T23:06:00Z

REVIEWERS' NOTE: This PR is kind of a monster, apologies in advance. I've annotated each file with some commentary about what's worth reading and what's worth skipping, large parts of the PR are boring refactors on which I welcome comments but are not likely the best use of your time.

Feel free to skip any parts you don't feel are relevant to your interests. We are on a relatively tight timeline to get this out before Halite so if there are any larger pieces of feedback I'll likely save those for a future iteration. If you don't have time to read through this, then no worries and feel free to disregard the PR.

This PR provides wrapper classes for the halite environment and state as well as a convenient hook for the interpreter. This also reimplements the bulk of the original halite interpreter in the new board class, straightens out some logic inconsistencies in corner cases of the old implementation, and hopefully makes it a bit easier to parse what's happening in the code.

I tested the interpreter reimplementation by writing unit tests and by running actual episodes using the agents submitted to the shared repo. I can't validate that behavior is exactly the same but I can at least confirm that everyone's agents played complete and reasonably strategic-looking games that seemed similar or identical to the current implementation.

…nting

… for the Board class

…code

…, and added support for exporting a board to an observation

…etters

…vor of collision simulation

Fixed blank output lines on every call Added a --out parameter for writing to a file but still getting std out in the terminal Misc style fixes as I saw them Added some unit tests for halite

harrisse · 2020-06-11T07:31:14Z

kaggle_environments/envs/halite/halite.py

-
-    return board.action
+@board_agent
+def random_agent(board):


Random weights have been tweaked for readability as random is meant to be an example for users not a strong bot.

harrisse · 2020-06-11T07:32:08Z

kaggle_environments/envs/halite/halite.py



 agents = {"random": random_agent}


-def interpreter(state, env):
+def populate_board(state, env):


Everything in the populate_board method is the same as it was before, just refactored into its own method. Reviewers can skip to the next comment.

harrisse · 2020-06-11T07:35:01Z

kaggle_environments/envs/halite/halite.py

-                for uid, index in list(ships.items()):
-                    del ships[uid]
-                    del obs.players[index][2][uid]
+def interpreter(state, env):


Reviewers should continue reading here.

All observation update logic is now handled in the Board class, this includes applying actions, resolving collisions, gathering halite, depositing halite, and regenerating halite. Logic to eliminate players or otherwise control the meta-state of the game remains in this method.

harrisse · 2020-06-11T07:44:25Z

kaggle_environments/envs/halite/helpers.py

+THash = TypeVar('TComparable')
+
+
+def group_by(elements: Iterable[TElement], selector: Callable[[TElement], THash]) -> Dict[THash, List[TElement]]:


Needed to bring a friend in from Linq 😄

harrisse · 2020-06-11T07:50:54Z

kaggle_environments/envs/halite/helpers.py

+TValue = TypeVar('TValue')
+
+
+class ReadOnlyDict(Generic[TKey, TValue]):


Everything from here to the start of the Board class is mainly just data binding models. Reviewers can mostly skim these, but there are some small bits of logic like ShipAction.to_point and Player.next_actions

Please let me know though if you have suggestions or questions on the format, structure, or purpose of the binding models. I'd generally like to reuse this pattern for future competitions if the community gets good use out of it.

harrisse · 2020-06-11T07:53:39Z

kaggle_environments/envs/halite/helpers.py

+
+
+class Board:
+    def __init__(


The constructor is basically just used to deserialize the raw observation and configuration received from the halite interpreter into the richer Board model that we'll make use of in the next() method down below.

harrisse · 2020-06-11T07:54:33Z

kaggle_environments/envs/halite/helpers.py

+
+    @property
+    def observation(self) -> Dict[str, Any]:
+        """Converts a Board back to the normalized observation that constructed it."""


This is the observation serializer to the constructor's observation deserializer

harrisse · 2020-06-11T07:55:48Z

kaggle_environments/envs/halite/helpers.py

+
+    def __deepcopy__(self, _) -> 'Board':
+        actions = [player.next_actions for player in self.players.values()]
+        return Board(self.observation, self.configuration, actions)


Because the transformation to an observation copies all of the data into new data structures we can use it for deepcopy as well to isolate a copied instance from its parent.

❓ Could someone still mutate next_actions dictionaries though after this call? Should those be explicitly deepcopy'd as well?

next_actions is a computed property constructed from the player's ships and shipyards so we don't have to worry about modification

harrisse · 2020-06-11T07:57:01Z

kaggle_environments/envs/halite/helpers.py

+        Player 1 is letter a/A
+        Player 2 is letter b/B
+        etc.
+        """


I'm curious if people think this belongs in __str__ or another method like print_board or something like that?

I think it's fine in __str__. It's easy to add an additional helper later. You could also argue that observations could be __repr__, but I think being explicit there makes sense because __repr__ is usually just a string.

harrisse · 2020-06-11T07:57:22Z

kaggle_environments/envs/halite/helpers.py

+            result += '|\n'
+        return result
+
+    def _add_ship(self: 'Board', ship: Ship) -> None:


Undocumented as these are meant to be internal only

harrisse · 2020-06-11T07:57:46Z

kaggle_environments/envs/halite/helpers.py

+        The current board is unmodified.
+        This can form a halite interpreter, e.g.
+            next_observation = Board(current_observation, configuration, actions).next.observation
+        """


This is the replacement for the observation update logic in the halite interpreter.

harrisse · 2020-06-11T07:58:58Z

kaggle_environments/main.py

@@ -63,6 +63,9 @@
 parser.add_argument(
    "--host", type=str, help="http-server Host (default=127.0.0.1)."
 )
+parser.add_argument(
+    "--out", type=str, help="Output file to write the results of the episode."


This allows you to write the results to an html file while still getting print statements in std out.

harrisse · 2020-06-12T06:42:58Z

kaggle_environments/envs/halite/test_halite.py

+
+
+def test_no_move_on_halite_gathers_halite():


There are lots of other scenarios we could test in here (like multiship collisions on shipyards, spawning collisions, etc) but I tried to cover most core rules with this pass.

harrisse · 2020-06-12T06:43:34Z

kaggle_environments/utils.py

-        print(buffer.getvalue())
-        if fallback != None:
+        output = buffer.getvalue()
+        if output:


This change gets rid of most of those annoying blank lines in the output

harrisse · 2020-06-12T06:44:13Z

kaggle_environments/utils.py

@@ -15,6 +15,7 @@
 import json


This file is all style and formatting updates, reviewers can likely skip this file.

moserware

Exciting to see this launch soon! Looks pretty good. I left a few comments/suggestions.

If you haven't already, make sure to run pylint and mypy and either correct the errors given or fix them (and/or explicitly ignore ones you want to ignore with the appropriate comment) to get them to pass without warnings.

Also, I'm assuming that agents can't exploit the runtime internal variables to allow cheating that otherwise wouldn't be possible?

moserware · 2020-06-12T13:38:20Z

kaggle_environments/envs/halite/halite.py

+    shipyards = me.shipyards
+    for shipyard in shipyards:
+        # 20% chance to spawn
+        shipyard.next_action = choice([ShipyardAction.SPAWN, None, None, None, None])


Using the weighted random.choices might be clearer here.

moserware · 2020-06-12T13:40:52Z

kaggle_environments/envs/halite/halite.py

+
+    actions = [agent.action for agent in state]
+    board = Board(obs, config, actions)
+    board = board.next()


Nice and concise 👏

moserware · 2020-06-12T13:49:49Z

kaggle_environments/envs/halite/helpers.py

+# region Helper Classes and Methods
+Point = NewType('Point', Tuple[int, int])
+
+


Could potentially be useful to have a comment and/or ASCII art to show the coordinate system.

moserware · 2020-06-12T13:56:51Z

kaggle_environments/envs/halite/helpers.py

+    See index_to_position for the inverse.
+    """
+    x, y = point
+    return (size - y - 1) * size + x


❓ This seems like a change in coordinate system from before? Did you intentionally want (0,0) to be at (size - 1) * size? Is that clearer than it being at 0?

This puts 0 in the lower left corner and (size - 1, size - 1) in the upper right which I personally found a lot more intuitive, curious to get others' thoughts on this though.

Not sure I'm fully understanding, but my personal take is that (0, 0) as top left makes more sense because that's how 2d arrays work, and in particular the format used in the replay files. I see how your way is intuitive relative to normal XY plane but IMO it's better to match the programming methodology. But just a personal take and IIUC a lot / all of this is internal and the higher-level structure here means users aren't necessarily forced into either paradigm?

@dster2 Yeah users would only even notice the coordinate system if they were constructing offsets by hand. If they only used NORTH/SOUTH/EAST/WEST they'd likely never even notice there are coordinates. I'll consider this a bit more, thanks for the feedback all.

moserware · 2020-06-12T14:01:18Z

kaggle_environments/envs/halite/helpers.py

+        if isinstance(data, dict):
+            self._data = data
+        else:
+            # If it's not a Dict it must be a ReadOnlyDict based on our type


✋ Note that they are type hints and not actually enforced by the runtime. You might want to explicitly check and raise explicitly if not true.

My goal here wasn't to validate that it must be a readonlydict but that if the user passed in the correct type and it's not a dict then it must be a readonlydict.

moserware · 2020-06-12T15:00:09Z

kaggle_environments/envs/halite/helpers.py

+
+            player._halite += leftover_convert_halite
+
+        def resolve_collision(ships: List[Ship]) -> Tuple[Optional[Ship], List[Ship]]:


These methods are specific to the game's rules and could be staticmethod? I wonder if it's worth factoring out to a separate class for clarity?

moserware · 2020-06-12T15:07:16Z

kaggle_environments/envs/halite/helpers.py

+                board._delete_shipyard(shipyard)
+                board._delete_ship(ship)
+
+        # Collect halite from cells into ships


❓ Just to confirm, if a smaller ship collides with a bigger ship, it should also collect in that same turn?

I'll double check the rules to make sure they're consistent with this behavior but I believe so

moserware · 2020-06-12T15:10:04Z

kaggle_environments/envs/halite/helpers.py

+            delta_halite = int(cell.halite * configuration.collect_rate)
+            if cell.shipyard_id is None and delta_halite > 0:
+                ship._halite += delta_halite
+                cell._halite -= delta_halite


might want another assert cell._halite > 0 after this? Just to confirm, a cell cannot run out of halite?

moserware · 2020-06-12T15:12:00Z

kaggle_environments/envs/halite/helpers.py

+    def agent_wrapper(obs, config) -> Dict[str, str]:
+        board = Board(obs, config)
+        agent(board)
+        return board.current_player.next_actions


Am assuming the JSON type/domain checking is already verifying these actions are valid.

These get filtered out and ignored in the board constructor if they're not valid

moserware · 2020-06-12T15:18:07Z

kaggle_environments/utils.py

-        print(buffer.getvalue())
-        if fallback != None:
+        output = buffer.getvalue()
+        if output:


harrisse · 2020-06-12T20:08:44Z

@moserware Re: internal only methods and exploits, no bots can't use them unless users are running untrusted code locally (which we ultimately can't protect against). In production we isolate agents in different containers so there's no risk of them being able to call each others' methods.

…ers by elimination order

Human rendering was not working anymore since Kaggle#43

harrisse added 12 commits June 2, 2020 20:20

Started building out halite helper classes

6693070

Finished initial pass at helper classes for Halite

62fe6d5

Removed Point class, cleaned up board population, and added board pri…

deea1c8

…nting

Added more convenience properties for Halite, filled in some comments…

ac0f18e

… for the Board class

Added basic collision detection and action formation logic to helper …

562d241

…code

General helper cleanup, fixed a bug with serializing shipyard actions…

bf52ab8

…, and added support for exporting a board to an observation

Calculate remaining halite as actions occur, add unprotected action s…

091e6b8

…etters

Move to board-based interpreter and remove collision prevention in fa…

99b90a2

…vor of collision simulation

Made raw Board constructor hidden

6a7d1a7

More progress in board <-> interpreter refactor and cleanup

64f644b

Merge branch 'master' into halite-helpers

e430a67

Commented up halite, fixed a couple small halite bugs

ea4dafd

Fixed blank output lines on every call Added a --out parameter for writing to a file but still getting std out in the terminal Misc style fixes as I saw them Added some unit tests for halite

harrisse commented Jun 11, 2020

View reviewed changes

Style updates

8a2dc09

harrisse commented Jun 11, 2020

View reviewed changes

Swapped north and south, handle step updates in Board

bec5a5b

harrisse commented Jun 12, 2020

View reviewed changes

harrisse requested a review from alexisbcook June 12, 2020 06:47

harrisse requested review from moserware, myles-oneill, pculliton and dster2 and removed request for myles-oneill and alexisbcook June 12, 2020 06:47

Made non-top-level .observation properties private

7b6fbb9

moserware reviewed Jun 12, 2020

View reviewed changes

harrisse added 6 commits June 12, 2020 14:04

Reorder halite rules

d027e91

Deposit halite after collision

1ae58cb

Readd move costs, handle depositing halite after collision, rank play…

69a0c6e

…ers by elimination order

PR updates

781cf62

Fix random bot

228deb6

Rewrote random agent

7cc6a37

harrisse merged commit 06bd76f into master Jun 12, 2020

johan-gras added a commit to johan-gras/kaggle-environments that referenced this pull request Dec 15, 2020

Fix env.render("human")

d0c427f

Human rendering was not working anymore since Kaggle#43

johan-gras mentioned this pull request Dec 15, 2020

Fix env.render(mode="human") #120

Open

		THash = TypeVar('TComparable')


		def group_by(elements: Iterable[TElement], selector: Callable[[TElement], THash]) -> Dict[THash, List[TElement]]:

		TValue = TypeVar('TValue')


		class ReadOnlyDict(Generic[TKey, TValue]):

		# region Helper Classes and Methods
		Point = NewType('Point', Tuple[int, int])


		player._halite += leftover_convert_halite

		def resolve_collision(ships: List[Ship]) -> Tuple[Optional[Ship], List[Ship]]:

Halite helpers #43

Halite helpers #43

Conversation

harrisse commented Jun 3, 2020 • edited Loading

Choose a reason for hiding this comment

harrisse Jun 11, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

harrisse Jun 11, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

moserware left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

harrisse commented Jun 12, 2020

harrisse commented Jun 3, 2020 •

edited

Loading

harrisse Jun 11, 2020 •

edited

Loading

harrisse Jun 11, 2020 •

edited

Loading